PeiYang Wang, Lei Liu, ZhiYang Xie, GuanRui Ren, YiLi Hu, MeiJi Shen, Hui Wang, JiaDong Wang, YunTao Wang, Xiao-Tao Wu
{"title":"基于SHAP预测后路腰椎融合术后手术部位感染的可解释机器学习模型。","authors":"PeiYang Wang, Lei Liu, ZhiYang Xie, GuanRui Ren, YiLi Hu, MeiJi Shen, Hui Wang, JiaDong Wang, YunTao Wang, Xiao-Tao Wu","doi":"10.1016/j.wneu.2025.123942","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><div>This study aims to develop machine learning (ML) models combined with an explainable method for the prediction of surgical site infection (SSI) after posterior lumbar fusion surgery.</div></div><div><h3>Methods</h3><div>In this retrospective, single-center study, a total of 1016 consecutive patients who underwent posterior lumbar fusion surgery were included. A comprehensive dataset was established, encompassing demographic variables, comorbidities, preoperative evaluation, details related to diagnosed lumbar disease, preoperative laboratory tests, surgical specifics, and postoperative factors. Utilizing this dataset, 6nullML models were developed to predict the occurrence of SSI. Performance evaluation of the models on the testing set involved several metrics, including the receiver operating characteristic curve, the area under the receiver operating characteristic curve, accuracy, recall, F1 score, and precision. The Shapley Additive Explanations (SHAP) method was employed to generate interpretable predictions, enabling a comprehensive assessment of SSI risk and providing individualized interpretations of the model results.</div></div><div><h3>Results</h3><div>Among the 1016 retrospective cases included in the study, 36 (3.54%) experienced SSI. Out of the six models examined, the Extreme Gradient Boost model demonstrated the highest discriminatory performance on the testing set, achieving the following metrics: precision (0.9000), recall (0.8182), accuracy (0.9902), F1 score (0.8571), and area under the receiver operating characteristic curve (0.9447). By utilizing the SHAP method, several important predictors of SSI were identified, including the duration of indwelling jugular vein catheter, blood urea nitrogen levels, total protein levels, sustained fever, creatinine levels, triglycerides levels, monocyte count, diabetes mellitus, drainage time, white blood cell count, cerebral infarction, estimated blood loss, prealbumin levels, Prognostic Nutritional Index, low back pain, posterior fusion score, and osteoporosis.</div></div><div><h3>Conclusions</h3><div>ML-based prediction tools can accurately assess the risk of SSI after posterior lumbar fusion surgery. Additionally, ML combined with SHAP could provide a clear interpretation of individualized risk prediction and give physicians an intuitive comprehension of the effects of the model's essential features.</div></div>","PeriodicalId":23906,"journal":{"name":"World neurosurgery","volume":"197 ","pages":"Article 123942"},"PeriodicalIF":1.9000,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Explainable Machine Learning Models for Prediction of Surgical Site Infection After Posterior Lumbar Fusion Surgery Based on Shapley Additive Explanations\",\"authors\":\"PeiYang Wang, Lei Liu, ZhiYang Xie, GuanRui Ren, YiLi Hu, MeiJi Shen, Hui Wang, JiaDong Wang, YunTao Wang, Xiao-Tao Wu\",\"doi\":\"10.1016/j.wneu.2025.123942\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Objective</h3><div>This study aims to develop machine learning (ML) models combined with an explainable method for the prediction of surgical site infection (SSI) after posterior lumbar fusion surgery.</div></div><div><h3>Methods</h3><div>In this retrospective, single-center study, a total of 1016 consecutive patients who underwent posterior lumbar fusion surgery were included. A comprehensive dataset was established, encompassing demographic variables, comorbidities, preoperative evaluation, details related to diagnosed lumbar disease, preoperative laboratory tests, surgical specifics, and postoperative factors. Utilizing this dataset, 6nullML models were developed to predict the occurrence of SSI. Performance evaluation of the models on the testing set involved several metrics, including the receiver operating characteristic curve, the area under the receiver operating characteristic curve, accuracy, recall, F1 score, and precision. The Shapley Additive Explanations (SHAP) method was employed to generate interpretable predictions, enabling a comprehensive assessment of SSI risk and providing individualized interpretations of the model results.</div></div><div><h3>Results</h3><div>Among the 1016 retrospective cases included in the study, 36 (3.54%) experienced SSI. Out of the six models examined, the Extreme Gradient Boost model demonstrated the highest discriminatory performance on the testing set, achieving the following metrics: precision (0.9000), recall (0.8182), accuracy (0.9902), F1 score (0.8571), and area under the receiver operating characteristic curve (0.9447). By utilizing the SHAP method, several important predictors of SSI were identified, including the duration of indwelling jugular vein catheter, blood urea nitrogen levels, total protein levels, sustained fever, creatinine levels, triglycerides levels, monocyte count, diabetes mellitus, drainage time, white blood cell count, cerebral infarction, estimated blood loss, prealbumin levels, Prognostic Nutritional Index, low back pain, posterior fusion score, and osteoporosis.</div></div><div><h3>Conclusions</h3><div>ML-based prediction tools can accurately assess the risk of SSI after posterior lumbar fusion surgery. Additionally, ML combined with SHAP could provide a clear interpretation of individualized risk prediction and give physicians an intuitive comprehension of the effects of the model's essential features.</div></div>\",\"PeriodicalId\":23906,\"journal\":{\"name\":\"World neurosurgery\",\"volume\":\"197 \",\"pages\":\"Article 123942\"},\"PeriodicalIF\":1.9000,\"publicationDate\":\"2025-03-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"World neurosurgery\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1878875025002980\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"CLINICAL NEUROLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"World neurosurgery","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1878875025002980","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}
Explainable Machine Learning Models for Prediction of Surgical Site Infection After Posterior Lumbar Fusion Surgery Based on Shapley Additive Explanations
Objective
This study aims to develop machine learning (ML) models combined with an explainable method for the prediction of surgical site infection (SSI) after posterior lumbar fusion surgery.
Methods
In this retrospective, single-center study, a total of 1016 consecutive patients who underwent posterior lumbar fusion surgery were included. A comprehensive dataset was established, encompassing demographic variables, comorbidities, preoperative evaluation, details related to diagnosed lumbar disease, preoperative laboratory tests, surgical specifics, and postoperative factors. Utilizing this dataset, 6nullML models were developed to predict the occurrence of SSI. Performance evaluation of the models on the testing set involved several metrics, including the receiver operating characteristic curve, the area under the receiver operating characteristic curve, accuracy, recall, F1 score, and precision. The Shapley Additive Explanations (SHAP) method was employed to generate interpretable predictions, enabling a comprehensive assessment of SSI risk and providing individualized interpretations of the model results.
Results
Among the 1016 retrospective cases included in the study, 36 (3.54%) experienced SSI. Out of the six models examined, the Extreme Gradient Boost model demonstrated the highest discriminatory performance on the testing set, achieving the following metrics: precision (0.9000), recall (0.8182), accuracy (0.9902), F1 score (0.8571), and area under the receiver operating characteristic curve (0.9447). By utilizing the SHAP method, several important predictors of SSI were identified, including the duration of indwelling jugular vein catheter, blood urea nitrogen levels, total protein levels, sustained fever, creatinine levels, triglycerides levels, monocyte count, diabetes mellitus, drainage time, white blood cell count, cerebral infarction, estimated blood loss, prealbumin levels, Prognostic Nutritional Index, low back pain, posterior fusion score, and osteoporosis.
Conclusions
ML-based prediction tools can accurately assess the risk of SSI after posterior lumbar fusion surgery. Additionally, ML combined with SHAP could provide a clear interpretation of individualized risk prediction and give physicians an intuitive comprehension of the effects of the model's essential features.
期刊介绍:
World Neurosurgery has an open access mirror journal World Neurosurgery: X, sharing the same aims and scope, editorial team, submission system and rigorous peer review.
The journal''s mission is to:
-To provide a first-class international forum and a 2-way conduit for dialogue that is relevant to neurosurgeons and providers who care for neurosurgery patients. The categories of the exchanged information include clinical and basic science, as well as global information that provide social, political, educational, economic, cultural or societal insights and knowledge that are of significance and relevance to worldwide neurosurgery patient care.
-To act as a primary intellectual catalyst for the stimulation of creativity, the creation of new knowledge, and the enhancement of quality neurosurgical care worldwide.
-To provide a forum for communication that enriches the lives of all neurosurgeons and their colleagues; and, in so doing, enriches the lives of their patients.
Topics to be addressed in World Neurosurgery include: EDUCATION, ECONOMICS, RESEARCH, POLITICS, HISTORY, CULTURE, CLINICAL SCIENCE, LABORATORY SCIENCE, TECHNOLOGY, OPERATIVE TECHNIQUES, CLINICAL IMAGES, VIDEOS