Development and Validation of a Machine Learning-Based Risk Prediction Model for Postoperative Delirium in Older Patients with Hip Fracture

The Journals of Gerontology Series A: Biological Sciences and Medical Sciences Pub Date : 2025-09-20 DOI:10.1093/gerona/glaf200

Weili Zhang, Nan Tang, Jie Song, Mi Song, Qingqing Su, Xiaojie Fu, Yuan Gao

{"title":"Development and Validation of a Machine Learning-Based Risk Prediction Model for Postoperative Delirium in Older Patients with Hip Fracture","authors":"Weili Zhang, Nan Tang, Jie Song, Mi Song, Qingqing Su, Xiaojie Fu, Yuan Gao","doi":"10.1093/gerona/glaf200","DOIUrl":null,"url":null,"abstract":"Background Postoperative delirium (POD) is associated with impaired cognitive function, increased morbidity, and mortality. Early identification of high-risk patients is critical for effective intervention. Methods Data from 2,516 older patients with hip fractures treated at the First Medical Center of the Chinese PLA General Hospital were retrospectively collected. Logistic Regression (LR), Random Forest (RF), Classification and Regression Tree (CART), Support Vector Machine (SVM), and Extreme Gradient Boosting (XGBoost) were used to construct the prediction models. SHapley Additive exPlanation (SHAP) analysis was performed to visualize the optimal model. External validation was conducted on 176 patients from March 2022 to November 2023 to assess the model's clinical applicability. Results The training dataset included 2,516 older patients, of which 367 (14.59%) developed POD. XGBoost demonstrated the best predictive performance (AUC = 0.92; accuracy = 86.4%; sensitivity = 87.7%; specificity = 85.1%; Brier score = 0.15). SHAP analysis ranked PNI (Prognostic Nutritional Index), ASA (American Society of Anesthesiologists classification), and age as the top three predictors. External validation on 176 patients showed the XGBoost model maintained strong performance (AUC = 0.89; accuracy = 83.0%; sensitivity = 95.8%; specificity = 80.9%; Brier score = 0.15). Conclusions An ML-based model was developed and validated to predict postoperative delirium risk in older patients with hip fracture. These findings may help to develop personalized interventions to provide better treatment plans and optimal resource allocation. The interpretable framework can increase the transparency of the model and facilitate understanding the reliability of the predictive model for the physicians.","PeriodicalId":22892,"journal":{"name":"The Journals of Gerontology Series A: Biological Sciences and Medical Sciences","volume":"18 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Journals of Gerontology Series A: Biological Sciences and Medical Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/gerona/glaf200","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Background Postoperative delirium (POD) is associated with impaired cognitive function, increased morbidity, and mortality. Early identification of high-risk patients is critical for effective intervention. Methods Data from 2,516 older patients with hip fractures treated at the First Medical Center of the Chinese PLA General Hospital were retrospectively collected. Logistic Regression (LR), Random Forest (RF), Classification and Regression Tree (CART), Support Vector Machine (SVM), and Extreme Gradient Boosting (XGBoost) were used to construct the prediction models. SHapley Additive exPlanation (SHAP) analysis was performed to visualize the optimal model. External validation was conducted on 176 patients from March 2022 to November 2023 to assess the model's clinical applicability. Results The training dataset included 2,516 older patients, of which 367 (14.59%) developed POD. XGBoost demonstrated the best predictive performance (AUC = 0.92; accuracy = 86.4%; sensitivity = 87.7%; specificity = 85.1%; Brier score = 0.15). SHAP analysis ranked PNI (Prognostic Nutritional Index), ASA (American Society of Anesthesiologists classification), and age as the top three predictors. External validation on 176 patients showed the XGBoost model maintained strong performance (AUC = 0.89; accuracy = 83.0%; sensitivity = 95.8%; specificity = 80.9%; Brier score = 0.15). Conclusions An ML-based model was developed and validated to predict postoperative delirium risk in older patients with hip fracture. These findings may help to develop personalized interventions to provide better treatment plans and optimal resource allocation. The interpretable framework can increase the transparency of the model and facilitate understanding the reliability of the predictive model for the physicians.

查看原文本刊更多论文

基于机器学习的老年髋部骨折患者术后谵妄风险预测模型的开发与验证

背景术后谵妄（POD）与认知功能受损、发病率和死亡率增加有关。早期识别高危患者对有效干预至关重要。方法回顾性分析解放军总医院第一医疗中心收治的2516例老年髋部骨折患者的资料。采用Logistic回归（LR）、随机森林（RF）、分类与回归树（CART）、支持向量机（SVM）和极端梯度增强（XGBoost）等方法构建预测模型。采用SHapley加性解释（SHAP）分析可视化优化模型。从2022年3月到2023年11月，对176例患者进行了外部验证，以评估该模型的临床适用性。结果训练数据集包括2516例老年患者，其中367例（14.59%）发生POD。XGBoost的预测效果最佳（AUC = 0.92，准确率= 86.4%，灵敏度= 87.7%，特异性= 85.1%，Brier评分= 0.15）。SHAP分析将PNI（预后营养指数）、ASA（美国麻醉医师学会分类）和年龄列为前三大预测因素。对176例患者的外部验证表明，XGBoost模型保持了较好的性能（AUC = 0.89，准确率= 83.0%，灵敏度= 95.8%，特异性= 80.9%，Brier评分= 0.15）。结论建立了一种基于ml的模型，并验证了该模型对老年髋部骨折患者术后谵妄风险的预测。这些发现可能有助于制定个性化的干预措施，以提供更好的治疗计划和最佳的资源分配。可解释的框架可以增加模型的透明度，便于医生理解预测模型的可靠性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

The Journals of Gerontology Series A: Biological Sciences and Medical Sciences

自引率

0.00%

发文量