External validation of a prediction model for disability and pain after lumbar disc herniation surgery: a prospective international registry-based cohort study.
Allan Abbott, Casper Friis Pedersen, Henrik Hedevik, Catharina Parai, Martin A Gorosito, Mikkel Andersen, Tor Ingebrigtsen, Tore K Solberg, Margreth Grotle, Bjørnar Berg
{"title":"External validation of a prediction model for disability and pain after lumbar disc herniation surgery: a prospective international registry-based cohort study.","authors":"Allan Abbott, Casper Friis Pedersen, Henrik Hedevik, Catharina Parai, Martin A Gorosito, Mikkel Andersen, Tor Ingebrigtsen, Tore K Solberg, Margreth Grotle, Bjørnar Berg","doi":"10.2340/17453674.2025.44251","DOIUrl":null,"url":null,"abstract":"<p><strong>Background and purpose: </strong> We aimed to externally validate machine learning models developed in Norway by evaluating their predictive outcome of disability and pain 12 months after lumbar disc herniation surgery in a Swedish and Danish cohort.</p><p><strong>Methods: </strong> Data was extracted for patients undergoing microdiscectomy or open discectomy for lumbar disc herniation in the NORspine, SweSpine and DaneSpine national registries. Outcome of interest was changes in Oswestry disability index (ODI) (≥ 22 points), Numeric Rating Scale (NRS) for back pain (≥ 2 points), and NRS for leg pain (≥ 4 points). Model performance was evaluated by discrimination (C-statistic), calibration, overall fit, and net benefit.</p><p><strong>Results: </strong> For the ODI model, the NORspine cohort included 22,529 patients, the SweSpine cohort included 10,129 patients, and DaneSpine 5,670 patients. The ODI model's C-statistic varied between 0.76 and 0.81 and calibration slope point estimates varied between 0.84 and 0.99. The C-statistic for NRS back pain varied between 0.70 and 0.76, and calibration slopes varied between 0.79 and 1.03. The C-statistic for NRS leg pain varied between 0.71 and 0.74, and calibration slopes varied between 0.90 and 1.02. There was acceptable overall fit and calibration metrics with minor-modest but explainable heterogeneity observed in the calibration plots. Decision curve analyses displayed clear potential net benefit in treatment in accordance with the prediction models compared with treating all patients or none.</p><p><strong>Conclusion: </strong> Predictive performance of machine learning models for treatment success/non-success in disability and pain at 12 months post-surgery for lumbar disc herniation showed acceptable discrimination ability, calibration, overall fit, and net benefit reproducible in similar international contexts. Future clinical impact studies are required.</p>","PeriodicalId":6916,"journal":{"name":"Acta Orthopaedica","volume":"96 ","pages":"512-520"},"PeriodicalIF":2.5000,"publicationDate":"2025-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12232440/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Acta Orthopaedica","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2340/17453674.2025.44251","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ORTHOPEDICS","Score":null,"Total":0}
引用次数: 0
Abstract
Background and purpose: We aimed to externally validate machine learning models developed in Norway by evaluating their predictive outcome of disability and pain 12 months after lumbar disc herniation surgery in a Swedish and Danish cohort.
Methods: Data was extracted for patients undergoing microdiscectomy or open discectomy for lumbar disc herniation in the NORspine, SweSpine and DaneSpine national registries. Outcome of interest was changes in Oswestry disability index (ODI) (≥ 22 points), Numeric Rating Scale (NRS) for back pain (≥ 2 points), and NRS for leg pain (≥ 4 points). Model performance was evaluated by discrimination (C-statistic), calibration, overall fit, and net benefit.
Results: For the ODI model, the NORspine cohort included 22,529 patients, the SweSpine cohort included 10,129 patients, and DaneSpine 5,670 patients. The ODI model's C-statistic varied between 0.76 and 0.81 and calibration slope point estimates varied between 0.84 and 0.99. The C-statistic for NRS back pain varied between 0.70 and 0.76, and calibration slopes varied between 0.79 and 1.03. The C-statistic for NRS leg pain varied between 0.71 and 0.74, and calibration slopes varied between 0.90 and 1.02. There was acceptable overall fit and calibration metrics with minor-modest but explainable heterogeneity observed in the calibration plots. Decision curve analyses displayed clear potential net benefit in treatment in accordance with the prediction models compared with treating all patients or none.
Conclusion: Predictive performance of machine learning models for treatment success/non-success in disability and pain at 12 months post-surgery for lumbar disc herniation showed acceptable discrimination ability, calibration, overall fit, and net benefit reproducible in similar international contexts. Future clinical impact studies are required.
期刊介绍:
Acta Orthopaedica (previously Acta Orthopaedica Scandinavica) presents original articles of basic research interest, as well as clinical studies in the field of orthopedics and related sub disciplines. Ever since the journal was founded in 1930, by a group of Scandinavian orthopedic surgeons, the journal has been published for an international audience. Acta Orthopaedica is owned by the Nordic Orthopaedic Federation and is the official publication of this federation.