{"title":"Cobb angle prediction for adolescent idiopathic scoliosis via an explainable machine learning model","authors":"Yu Ding , Bin Li , Xiaoyong Guo","doi":"10.1016/j.array.2025.100455","DOIUrl":null,"url":null,"abstract":"<div><div>This study aims to build an accurate and interpretable machine learning model capable of adolescent idiopathic scoliosis prognostication. A tree-based gradient boosting machine is incorporated with a recently proposed Shapley-value-based explanation method-TreeExplainer. Anthropometric training data are collected from a public orthopedics clinic, and each instance is characterized by nine features with a prediction target. We adopt a transfer-learning strategy that takes advantage of the additive property of tree-based gradient boosting, allowing a gradient boosting machine regressor to be trained with limited labeled examples. Cross-validation estimation shows a satisfactory performance for predicting future spine curvature (Cobb angle). The root mean square error (<span><math><msup><mrow></mrow><mrow><mo>∘</mo></mrow></msup></math></span>), the mean absolute percentage error (<span><math><msup><mrow></mrow><mrow><mo>∘</mo></mrow></msup></math></span>), and the Pearson correlation coefficient are 3.69 ± 1.23, 2.81 ± 1.69, and 0.92 ± 0.01, respectively. Moreover, the overfitting has been largely removed, and the model may be generalized well to new patients. A well-trained model is taken as the input to the TreeExplainer. The output of the TreeExplainer provides us a richer understanding that demonstrates how a feature’s value impacts the model’s prediction for every instance. The patterns identified can substantially improve the human-artificial intelligence collaboration in the clinical management of patients with adolescent idiopathic scoliosis by preventing serious scoliosis progression and reducing healthcare costs.</div></div>","PeriodicalId":8417,"journal":{"name":"Array","volume":"27 ","pages":"Article 100455"},"PeriodicalIF":4.5000,"publicationDate":"2025-07-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Array","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2590005625000827","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, THEORY & METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
This study aims to build an accurate and interpretable machine learning model capable of adolescent idiopathic scoliosis prognostication. A tree-based gradient boosting machine is incorporated with a recently proposed Shapley-value-based explanation method-TreeExplainer. Anthropometric training data are collected from a public orthopedics clinic, and each instance is characterized by nine features with a prediction target. We adopt a transfer-learning strategy that takes advantage of the additive property of tree-based gradient boosting, allowing a gradient boosting machine regressor to be trained with limited labeled examples. Cross-validation estimation shows a satisfactory performance for predicting future spine curvature (Cobb angle). The root mean square error (), the mean absolute percentage error (), and the Pearson correlation coefficient are 3.69 ± 1.23, 2.81 ± 1.69, and 0.92 ± 0.01, respectively. Moreover, the overfitting has been largely removed, and the model may be generalized well to new patients. A well-trained model is taken as the input to the TreeExplainer. The output of the TreeExplainer provides us a richer understanding that demonstrates how a feature’s value impacts the model’s prediction for every instance. The patterns identified can substantially improve the human-artificial intelligence collaboration in the clinical management of patients with adolescent idiopathic scoliosis by preventing serious scoliosis progression and reducing healthcare costs.