Gianluca Caprili MD , Andrea G. Calamita MD , Michele Novi MD , Domenico A. Campanacci MD, PhD , Simone Nicoletti MD
{"title":"评估机器学习预测两种不同肩假体的准确性:外部验证研究","authors":"Gianluca Caprili MD , Andrea G. Calamita MD , Michele Novi MD , Domenico A. Campanacci MD, PhD , Simone Nicoletti MD","doi":"10.1016/j.jseint.2025.04.024","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>The integration of machine learning in orthopedic surgery, including shoulder procedures, has garnered increasing interest. This retrospective analysis aims to externally validate a predictive analytics platform within a patient cohort undergoing reverse total shoulder arthroplasty for eccentric or concentric glenohumeral osteoarthritis.</div></div><div><h3>Methods</h3><div>Ninety patients who underwent reverse total shoulder arthroplasty at our institution in 2022 were selected for this study. Patients were divided into 2 groups based on the type of implant (50 Exactech Equinoxe and 40 Zimmer Biomet Comprehensive). Preoperative evaluations included 19 variables per the tool requirements (demographics, diagnosis, comorbidities, patient-reported pain and function, and range of motion). The study compared the tool's outcome predictions with postoperative outcomes at 3-6 months, 1 year, and 2 years postsurgery for visual analog scale and active range of movement. We also quantified the mean absolute error (MAE) separately in the 2 groups and compared it to the MAE from the internal validation of the same tool.</div></div><div><h3>Results</h3><div>Significant improvements in visual analog scale, active forward elevation, active abduction, and active external rotation met the minimal clinically important difference at 3-6 months, 1 year, and 2 years in both implant groups. Additionally, a modest improvement in active internal rotation was observed in both cohorts. In terms of MAE, we found a higher error than the internal validation only for forward elevation at 3-6 months in group 2 and a lower error in all the other outcome measures at all time points for both groups.</div></div><div><h3>Conclusion</h3><div>The predictive analytics platform demonstrated a lower or similar MAE than the internal validation for both groups. Notably, we found that the tool's predictions are generalizable to another shoulder prosthesis, even though it was not trained on that particular product. This tool holds promise for aiding clinicians in managing patient expectations.</div></div>","PeriodicalId":34444,"journal":{"name":"JSES International","volume":"9 4","pages":"Pages 1352-1356"},"PeriodicalIF":0.0000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Assessing the accuracy of a machine learning prediction for 2 different shoulder prostheses: an external validation study\",\"authors\":\"Gianluca Caprili MD , Andrea G. Calamita MD , Michele Novi MD , Domenico A. Campanacci MD, PhD , Simone Nicoletti MD\",\"doi\":\"10.1016/j.jseint.2025.04.024\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background</h3><div>The integration of machine learning in orthopedic surgery, including shoulder procedures, has garnered increasing interest. This retrospective analysis aims to externally validate a predictive analytics platform within a patient cohort undergoing reverse total shoulder arthroplasty for eccentric or concentric glenohumeral osteoarthritis.</div></div><div><h3>Methods</h3><div>Ninety patients who underwent reverse total shoulder arthroplasty at our institution in 2022 were selected for this study. Patients were divided into 2 groups based on the type of implant (50 Exactech Equinoxe and 40 Zimmer Biomet Comprehensive). Preoperative evaluations included 19 variables per the tool requirements (demographics, diagnosis, comorbidities, patient-reported pain and function, and range of motion). The study compared the tool's outcome predictions with postoperative outcomes at 3-6 months, 1 year, and 2 years postsurgery for visual analog scale and active range of movement. We also quantified the mean absolute error (MAE) separately in the 2 groups and compared it to the MAE from the internal validation of the same tool.</div></div><div><h3>Results</h3><div>Significant improvements in visual analog scale, active forward elevation, active abduction, and active external rotation met the minimal clinically important difference at 3-6 months, 1 year, and 2 years in both implant groups. Additionally, a modest improvement in active internal rotation was observed in both cohorts. In terms of MAE, we found a higher error than the internal validation only for forward elevation at 3-6 months in group 2 and a lower error in all the other outcome measures at all time points for both groups.</div></div><div><h3>Conclusion</h3><div>The predictive analytics platform demonstrated a lower or similar MAE than the internal validation for both groups. Notably, we found that the tool's predictions are generalizable to another shoulder prosthesis, even though it was not trained on that particular product. This tool holds promise for aiding clinicians in managing patient expectations.</div></div>\",\"PeriodicalId\":34444,\"journal\":{\"name\":\"JSES International\",\"volume\":\"9 4\",\"pages\":\"Pages 1352-1356\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"JSES International\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666638325001495\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"Medicine\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"JSES International","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666638325001495","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Medicine","Score":null,"Total":0}
Assessing the accuracy of a machine learning prediction for 2 different shoulder prostheses: an external validation study
Background
The integration of machine learning in orthopedic surgery, including shoulder procedures, has garnered increasing interest. This retrospective analysis aims to externally validate a predictive analytics platform within a patient cohort undergoing reverse total shoulder arthroplasty for eccentric or concentric glenohumeral osteoarthritis.
Methods
Ninety patients who underwent reverse total shoulder arthroplasty at our institution in 2022 were selected for this study. Patients were divided into 2 groups based on the type of implant (50 Exactech Equinoxe and 40 Zimmer Biomet Comprehensive). Preoperative evaluations included 19 variables per the tool requirements (demographics, diagnosis, comorbidities, patient-reported pain and function, and range of motion). The study compared the tool's outcome predictions with postoperative outcomes at 3-6 months, 1 year, and 2 years postsurgery for visual analog scale and active range of movement. We also quantified the mean absolute error (MAE) separately in the 2 groups and compared it to the MAE from the internal validation of the same tool.
Results
Significant improvements in visual analog scale, active forward elevation, active abduction, and active external rotation met the minimal clinically important difference at 3-6 months, 1 year, and 2 years in both implant groups. Additionally, a modest improvement in active internal rotation was observed in both cohorts. In terms of MAE, we found a higher error than the internal validation only for forward elevation at 3-6 months in group 2 and a lower error in all the other outcome measures at all time points for both groups.
Conclusion
The predictive analytics platform demonstrated a lower or similar MAE than the internal validation for both groups. Notably, we found that the tool's predictions are generalizable to another shoulder prosthesis, even though it was not trained on that particular product. This tool holds promise for aiding clinicians in managing patient expectations.