Ziliang Cheng , Jingjing Li , Weishan Wu , Jiguang Yin , Xiangpeng Wang
{"title":"开发和验证机器学习模型来预测全膝关节置换术后的慢性疼痛","authors":"Ziliang Cheng , Jingjing Li , Weishan Wu , Jiguang Yin , Xiangpeng Wang","doi":"10.1016/j.knee.2025.05.011","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><div>This study aims to compare the performance of various machine learning algorithms in predicting chronic pain after total knee arthroplasty (CPSP).</div></div><div><h3>Methods</h3><div>Patients with CPSP after total knee arthroplasty at the same medical center between January 1, 2021, and January 1, 2023, were selected for this study. A retrospective cohort design was employed to collect samples, which were then randomly divided into a training set and a test set in a 7:3 ratio. Valid high-risk factors were identified using the Least Absolute Shrinkage and Selection Operator (LASSO) method. Subsequently, five predictive models were constructed and evaluated based on machine learning (ML) algorithms, including Decision Tree (DT), Light Gradient Boosting Machine (LGBM), Support Vector Machine (SVM), Random Forest (RF), and Extreme Gradient Boosting (XGBoost). In the test dataset, the model’s performance was evaluated using metrics including accuracy, precision, recall, specificity, F1-score, Brier score, and area under the curve (AUC). The Brier score helped identify the most suitable model, and SHAP values were explained to analyze the key factors affecting the predictions.</div></div><div><h3>Results</h3><div>This study enrolled 785 patients who underwent total knee arthroplasty, with 549 in the training − set and 236 in the test − set. The overall CPSP incidence was 39.6%. Nine high − risk factors were identified: hospital stay length, albumin levels, acute postoperative pain status (APSP), non-operative pain status, pain catastrophizing, osteoporosis, preoperative operative-area pain score, education level, and rehabilitation site. The AUC values were: DT(0.877), LGBM(0.914), SVM(0.890), RF(0.918), and XGBoost(0.898). The Brier scores were: DT (0.123), LGBM (0.119), SVM (0.126), RF (0.111), and XGBoost (0.124). These findings suggest that the RF model had the best performance.</div></div><div><h3>Conclusion</h3><div>The incidence of CPSP in TKA patients is high, which has a significant adverse effect on body function and needs to be paid attention to. Nine risk factors have been identified. RF model can effectively identify CPSP patients, which is helpful for clinical medical staff to early identify and intervene in high-risk patients.</div></div>","PeriodicalId":56110,"journal":{"name":"Knee","volume":"56 ","pages":"Pages 52-65"},"PeriodicalIF":1.6000,"publicationDate":"2025-05-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Developing and validating a machine learning model to predict chronic pain following total knee arthroplasty\",\"authors\":\"Ziliang Cheng , Jingjing Li , Weishan Wu , Jiguang Yin , Xiangpeng Wang\",\"doi\":\"10.1016/j.knee.2025.05.011\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Objective</h3><div>This study aims to compare the performance of various machine learning algorithms in predicting chronic pain after total knee arthroplasty (CPSP).</div></div><div><h3>Methods</h3><div>Patients with CPSP after total knee arthroplasty at the same medical center between January 1, 2021, and January 1, 2023, were selected for this study. A retrospective cohort design was employed to collect samples, which were then randomly divided into a training set and a test set in a 7:3 ratio. Valid high-risk factors were identified using the Least Absolute Shrinkage and Selection Operator (LASSO) method. Subsequently, five predictive models were constructed and evaluated based on machine learning (ML) algorithms, including Decision Tree (DT), Light Gradient Boosting Machine (LGBM), Support Vector Machine (SVM), Random Forest (RF), and Extreme Gradient Boosting (XGBoost). In the test dataset, the model’s performance was evaluated using metrics including accuracy, precision, recall, specificity, F1-score, Brier score, and area under the curve (AUC). The Brier score helped identify the most suitable model, and SHAP values were explained to analyze the key factors affecting the predictions.</div></div><div><h3>Results</h3><div>This study enrolled 785 patients who underwent total knee arthroplasty, with 549 in the training − set and 236 in the test − set. The overall CPSP incidence was 39.6%. Nine high − risk factors were identified: hospital stay length, albumin levels, acute postoperative pain status (APSP), non-operative pain status, pain catastrophizing, osteoporosis, preoperative operative-area pain score, education level, and rehabilitation site. The AUC values were: DT(0.877), LGBM(0.914), SVM(0.890), RF(0.918), and XGBoost(0.898). The Brier scores were: DT (0.123), LGBM (0.119), SVM (0.126), RF (0.111), and XGBoost (0.124). These findings suggest that the RF model had the best performance.</div></div><div><h3>Conclusion</h3><div>The incidence of CPSP in TKA patients is high, which has a significant adverse effect on body function and needs to be paid attention to. Nine risk factors have been identified. RF model can effectively identify CPSP patients, which is helpful for clinical medical staff to early identify and intervene in high-risk patients.</div></div>\",\"PeriodicalId\":56110,\"journal\":{\"name\":\"Knee\",\"volume\":\"56 \",\"pages\":\"Pages 52-65\"},\"PeriodicalIF\":1.6000,\"publicationDate\":\"2025-05-21\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Knee\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0968016025001176\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ORTHOPEDICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knee","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0968016025001176","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ORTHOPEDICS","Score":null,"Total":0}
Developing and validating a machine learning model to predict chronic pain following total knee arthroplasty
Objective
This study aims to compare the performance of various machine learning algorithms in predicting chronic pain after total knee arthroplasty (CPSP).
Methods
Patients with CPSP after total knee arthroplasty at the same medical center between January 1, 2021, and January 1, 2023, were selected for this study. A retrospective cohort design was employed to collect samples, which were then randomly divided into a training set and a test set in a 7:3 ratio. Valid high-risk factors were identified using the Least Absolute Shrinkage and Selection Operator (LASSO) method. Subsequently, five predictive models were constructed and evaluated based on machine learning (ML) algorithms, including Decision Tree (DT), Light Gradient Boosting Machine (LGBM), Support Vector Machine (SVM), Random Forest (RF), and Extreme Gradient Boosting (XGBoost). In the test dataset, the model’s performance was evaluated using metrics including accuracy, precision, recall, specificity, F1-score, Brier score, and area under the curve (AUC). The Brier score helped identify the most suitable model, and SHAP values were explained to analyze the key factors affecting the predictions.
Results
This study enrolled 785 patients who underwent total knee arthroplasty, with 549 in the training − set and 236 in the test − set. The overall CPSP incidence was 39.6%. Nine high − risk factors were identified: hospital stay length, albumin levels, acute postoperative pain status (APSP), non-operative pain status, pain catastrophizing, osteoporosis, preoperative operative-area pain score, education level, and rehabilitation site. The AUC values were: DT(0.877), LGBM(0.914), SVM(0.890), RF(0.918), and XGBoost(0.898). The Brier scores were: DT (0.123), LGBM (0.119), SVM (0.126), RF (0.111), and XGBoost (0.124). These findings suggest that the RF model had the best performance.
Conclusion
The incidence of CPSP in TKA patients is high, which has a significant adverse effect on body function and needs to be paid attention to. Nine risk factors have been identified. RF model can effectively identify CPSP patients, which is helpful for clinical medical staff to early identify and intervene in high-risk patients.
期刊介绍:
The Knee is an international journal publishing studies on the clinical treatment and fundamental biomechanical characteristics of this joint. The aim of the journal is to provide a vehicle relevant to surgeons, biomedical engineers, imaging specialists, materials scientists, rehabilitation personnel and all those with an interest in the knee.
The topics covered include, but are not limited to:
• Anatomy, physiology, morphology and biochemistry;
• Biomechanical studies;
• Advances in the development of prosthetic, orthotic and augmentation devices;
• Imaging and diagnostic techniques;
• Pathology;
• Trauma;
• Surgery;
• Rehabilitation.