{"title":"Deep learning for automated grading of radiographic sacroiliitis.","authors":"Xinyi Meng, Yongku Du, Rongrong Jia, Qing Zhou, Yuwei Xia, Feng Shi, Fanhui Zhao, Yanjun Gao","doi":"10.21037/qims-2024-2742","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Grading assessment of sacroiliitis via X-ray is widely used in clinical evaluation. The aim of this study was to develop and validate an artificial intelligence (AI) system to help physicians in assessing and diagnosing sacroiliitis from standard X-ray images.</p><p><strong>Methods: </strong>In this retrospective study, a deep learning model for the automated grading assessment of radiographic sacroiliitis was developed using pelvic X-ray images from a training set of 465 individuals (930 single sacroiliac joints) and a validation set of 195 individuals (390 single sacroiliac joints). The algorithm was tested using an external test set of 223 individuals (446 single sacroiliac joints). The receiver operating characteristic (ROC) curve was used to calculate the area under the curve (AUC), sensitivity, and specificity to assess the model's performance. The findings of the model were used as a reference to determine its utility in aiding radiologists in the diagnosis and grading assessment of sacroiliitis.</p><p><strong>Results: </strong>The neural network model demonstrated proficiency in assessing grading of sacroiliitis. In the external test set, the model achieved a grading accuracy rate of 63.90% for radiographic sacroiliitis, and its diagnostic accuracy for determining the presence of radiographic sacroiliitis reached 90.13%. With the assistance of the model, the diagnostic accuracy of radiological sacroiliac arthritis by two junior imaging physicians improved significantly, increasing from 92.45% and 91.10% to 97.17% and 95.29%, respectively. Furthermore, the accuracy of image grading (grades 0 to 4) also showed notable improvement, rising from 75.00% and 74.08% to 88.89% and 80.90%, respectively.</p><p><strong>Conclusions: </strong>The AI model demonstrated high diagnostic accuracy and can greatly enhance the precision of radiographic sacroiliitis grading.</p>","PeriodicalId":54267,"journal":{"name":"Quantitative Imaging in Medicine and Surgery","volume":"15 6","pages":"5137-5150"},"PeriodicalIF":2.9000,"publicationDate":"2025-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12209610/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Quantitative Imaging in Medicine and Surgery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.21037/qims-2024-2742","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/6/3 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Grading assessment of sacroiliitis via X-ray is widely used in clinical evaluation. The aim of this study was to develop and validate an artificial intelligence (AI) system to help physicians in assessing and diagnosing sacroiliitis from standard X-ray images.
Methods: In this retrospective study, a deep learning model for the automated grading assessment of radiographic sacroiliitis was developed using pelvic X-ray images from a training set of 465 individuals (930 single sacroiliac joints) and a validation set of 195 individuals (390 single sacroiliac joints). The algorithm was tested using an external test set of 223 individuals (446 single sacroiliac joints). The receiver operating characteristic (ROC) curve was used to calculate the area under the curve (AUC), sensitivity, and specificity to assess the model's performance. The findings of the model were used as a reference to determine its utility in aiding radiologists in the diagnosis and grading assessment of sacroiliitis.
Results: The neural network model demonstrated proficiency in assessing grading of sacroiliitis. In the external test set, the model achieved a grading accuracy rate of 63.90% for radiographic sacroiliitis, and its diagnostic accuracy for determining the presence of radiographic sacroiliitis reached 90.13%. With the assistance of the model, the diagnostic accuracy of radiological sacroiliac arthritis by two junior imaging physicians improved significantly, increasing from 92.45% and 91.10% to 97.17% and 95.29%, respectively. Furthermore, the accuracy of image grading (grades 0 to 4) also showed notable improvement, rising from 75.00% and 74.08% to 88.89% and 80.90%, respectively.
Conclusions: The AI model demonstrated high diagnostic accuracy and can greatly enhance the precision of radiographic sacroiliitis grading.