Caroline Constant, A Noelle Larson, David W Polly, Carl-Eric Aubin
{"title":"Neural network-based multi-task learning to assist planning of posterior spinal fusion surgery for adolescent idiopathic scoliosis.","authors":"Caroline Constant, A Noelle Larson, David W Polly, Carl-Eric Aubin","doi":"10.1007/s43390-025-01125-9","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>Posterior spinal instrumentation and fusion (PSF) is the gold standard for severe adolescent idiopathic scoliosis (AIS), yet instrumentation strategies vary widely, often leading to suboptimal results. Deep learning's potential in AIS planning is underexplored.</p><p><strong>Methods: </strong>This study trained and validated an artificial neural network multi-task learning model (NNML) using preoperative clinical and radiographic data from 189 AIS patients with Lenke 1A and 2A curves enrolled in the MIMO Clinical Trial (NCT01792609). The model mimics experienced spine surgeons' decision-making for selecting the upper and the lower instrumented vertebrae (UIV, LIV), determining rod curvature, and predicting screw density based on the study's randomized allocation. Models were trained with data from 179 patients, utilizing tenfold cross-validation, and externally validated on 10 patients from a separate hospital and surgeons outside the training set. For UIV and LIV selection, accuracy within the top two predictions was used as a classification performance metric, ensuring that other clinically relevant alternatives were considered.</p><p><strong>Results: </strong>The NNML, which comprised 83 inputs and multiple hidden layers, led to significant gains over ST-NN and proved more robust during the internal validation (loss 6.2 vs. 9.3; p ≤ 0.01). It showed 82-95% and 80-100% accuracy for UIV and LIV predictions and 70-90% accuracy for predicting the rod curvatures ± 5°. The RMSE for the screw density and rod curvature predictions was 0.2-0.3 and 3.7-5.6°, respectively.</p><p><strong>Conclusion: </strong>An NNML can better use the features of relevant AIS patients for mixed task prediction pertinent to PSF surgery planning than ST-NN. In addition, NNML was capable of mimicking experienced spine surgeons' decision-making process when designing the instrumentation.</p>","PeriodicalId":21796,"journal":{"name":"Spine deformity","volume":" ","pages":""},"PeriodicalIF":1.8000,"publicationDate":"2025-06-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Spine deformity","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s43390-025-01125-9","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose: Posterior spinal instrumentation and fusion (PSF) is the gold standard for severe adolescent idiopathic scoliosis (AIS), yet instrumentation strategies vary widely, often leading to suboptimal results. Deep learning's potential in AIS planning is underexplored.
Methods: This study trained and validated an artificial neural network multi-task learning model (NNML) using preoperative clinical and radiographic data from 189 AIS patients with Lenke 1A and 2A curves enrolled in the MIMO Clinical Trial (NCT01792609). The model mimics experienced spine surgeons' decision-making for selecting the upper and the lower instrumented vertebrae (UIV, LIV), determining rod curvature, and predicting screw density based on the study's randomized allocation. Models were trained with data from 179 patients, utilizing tenfold cross-validation, and externally validated on 10 patients from a separate hospital and surgeons outside the training set. For UIV and LIV selection, accuracy within the top two predictions was used as a classification performance metric, ensuring that other clinically relevant alternatives were considered.
Results: The NNML, which comprised 83 inputs and multiple hidden layers, led to significant gains over ST-NN and proved more robust during the internal validation (loss 6.2 vs. 9.3; p ≤ 0.01). It showed 82-95% and 80-100% accuracy for UIV and LIV predictions and 70-90% accuracy for predicting the rod curvatures ± 5°. The RMSE for the screw density and rod curvature predictions was 0.2-0.3 and 3.7-5.6°, respectively.
Conclusion: An NNML can better use the features of relevant AIS patients for mixed task prediction pertinent to PSF surgery planning than ST-NN. In addition, NNML was capable of mimicking experienced spine surgeons' decision-making process when designing the instrumentation.
目的:后路脊柱内固定融合(PSF)是治疗严重青少年特发性脊柱侧凸(AIS)的金标准,但内固定策略差异很大,往往导致不理想的结果。深度学习在AIS规划中的潜力尚未得到充分开发。方法:本研究使用参加MIMO临床试验(NCT01792609)的189名患有Lenke 1A和2A曲线的AIS患者的术前临床和影像学数据,训练并验证了人工神经网络多任务学习模型(NNML)。该模型模拟了经验丰富的脊柱外科医生在选择上、下固定椎体(UIV、LIV)、确定棒曲率和基于研究随机分配预测螺钉密度方面的决策。模型使用来自179名患者的数据进行训练,利用十倍交叉验证,并对来自独立医院和训练集之外的外科医生的10名患者进行外部验证。对于UIV和LIV的选择,使用前两个预测的准确性作为分类性能指标,确保考虑其他临床相关的替代方案。结果:由83个输入和多个隐藏层组成的NNML比ST-NN获得了显著的增益,并且在内部验证中证明了更强的鲁棒性(损失6.2 vs 9.3;p≤0.01)。对uv和LIV的预测精度分别为82-95%和80-100%,对棒材曲率±5°的预测精度为70-90%。螺杆密度和杆曲率预测的RMSE分别为0.2-0.3°和3.7-5.6°。结论:与ST-NN相比,NNML能更好地利用相关AIS患者的特征进行与PSF手术计划相关的混合任务预测。此外,NNML在设计器械时能够模仿经验丰富的脊柱外科医生的决策过程。
期刊介绍:
Spine Deformity the official journal of the?Scoliosis Research Society is a peer-refereed publication to disseminate knowledge on basic science and clinical research into the?etiology?biomechanics?treatment?methods and outcomes of all types of?spinal deformities. The international members of the Editorial Board provide a worldwide perspective for the journal's area of interest.The?journal?will enhance the mission of the Society which is to foster the optimal care of all patients with?spine?deformities worldwide. Articles published in?Spine Deformity?are Medline indexed in PubMed.? The journal publishes original articles in the form of clinical and basic research. Spine Deformity will only publish studies that have institutional review board (IRB) or similar ethics committee approval for human and animal studies and have strictly observed these guidelines. The minimum follow-up period for follow-up clinical studies is 24 months.