Yingying Lin , Jingqi Gao , Linfang Chen , Yixiao Hong , Min Li , Peiling Chen , Xiuling Shang
{"title":"An interpretable XGBoost model for risk prediction of progression from sepsis-associated acute kidney injury to chronic kidney disease","authors":"Yingying Lin , Jingqi Gao , Linfang Chen , Yixiao Hong , Min Li , Peiling Chen , Xiuling Shang","doi":"10.1016/j.imu.2025.101685","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><div>To develop an interpretable machine learning (ML) model for predicting the risk of progression from sepsis-associated acute kidney injury (SA-AKI) to chronic kidney disease (CKD) guiding early stratified interventions.</div></div><div><h3>Methods</h3><div>Using data from 1315 SA-AKI patients [Medical Information Mart for Intensive Care IV (MIMIC-IV) database], we constructed an extreme gradient boosting (XGBoost) model with SHapley Additive exPlanations (SHAP) interpretability. Performance was evaluated by discrimination, calibration, and clinical utility [decision curve analysis (DCA)].</div></div><div><h3>Results</h3><div>CKD incidence was 36.7 % (median onset: 7.6 months). The XGBoost model achieved: superior discrimination [training area under the curve (AUC) 0.920; validation AUC 0.951 versus Sequential Organ Failure Assessment (SOFA) renal 0.616 and logistic regression (LR) 0.822], robust calibration, and clinical applicability. SHAP identified actionable thresholds (age >65, maximum serum creatinine >0.9 mg/dl) for early intervention. Feature stability analysis revealed a stage-dependent coefficient drift for serum creatinine (Δβ = +0.84), reflecting dynamic pathophysiology. Crucially, the model provides clinically interpretable outputs without requiring SHAP expertise, enabling seamless integration into workflows.</div></div><div><h3>Conclusion</h3><div>Our model delivers personalized, interpretable CKD risk alerts for SA-AKI patients, empowering clinicians to stratify follow-up care. External validation is warranted to confirm generalizability.</div></div>","PeriodicalId":13953,"journal":{"name":"Informatics in Medicine Unlocked","volume":"58 ","pages":"Article 101685"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Informatics in Medicine Unlocked","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352914825000747","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0
Abstract
Objective
To develop an interpretable machine learning (ML) model for predicting the risk of progression from sepsis-associated acute kidney injury (SA-AKI) to chronic kidney disease (CKD) guiding early stratified interventions.
Methods
Using data from 1315 SA-AKI patients [Medical Information Mart for Intensive Care IV (MIMIC-IV) database], we constructed an extreme gradient boosting (XGBoost) model with SHapley Additive exPlanations (SHAP) interpretability. Performance was evaluated by discrimination, calibration, and clinical utility [decision curve analysis (DCA)].
Results
CKD incidence was 36.7 % (median onset: 7.6 months). The XGBoost model achieved: superior discrimination [training area under the curve (AUC) 0.920; validation AUC 0.951 versus Sequential Organ Failure Assessment (SOFA) renal 0.616 and logistic regression (LR) 0.822], robust calibration, and clinical applicability. SHAP identified actionable thresholds (age >65, maximum serum creatinine >0.9 mg/dl) for early intervention. Feature stability analysis revealed a stage-dependent coefficient drift for serum creatinine (Δβ = +0.84), reflecting dynamic pathophysiology. Crucially, the model provides clinically interpretable outputs without requiring SHAP expertise, enabling seamless integration into workflows.
Conclusion
Our model delivers personalized, interpretable CKD risk alerts for SA-AKI patients, empowering clinicians to stratify follow-up care. External validation is warranted to confirm generalizability.
期刊介绍:
Informatics in Medicine Unlocked (IMU) is an international gold open access journal covering a broad spectrum of topics within medical informatics, including (but not limited to) papers focusing on imaging, pathology, teledermatology, public health, ophthalmological, nursing and translational medicine informatics. The full papers that are published in the journal are accessible to all who visit the website.