Wutong Chen, Du Junsheng, Yanzhen Chen, Yifeng Fan, Hengzhi Liu, Chang Tan, Xuanming Shao, Xinzhi Li
{"title":"The Classification of Lumbar Spondylolisthesis X-Ray Images Using Convolutional Neural Networks","authors":"Wutong Chen, Du Junsheng, Yanzhen Chen, Yifeng Fan, Hengzhi Liu, Chang Tan, Xuanming Shao, Xinzhi Li","doi":"10.1007/s10278-024-01115-9","DOIUrl":null,"url":null,"abstract":"<p>We aimed to develop and validate a deep convolutional neural network (DCNN) model capable of accurately identifying spondylolysis or spondylolisthesis on lateral or dynamic X-ray images. A total of 2449 lumbar lateral and dynamic X-ray images were collected from two tertiary hospitals. These images were categorized into lumbar spondylolysis (LS), degenerative lumbar spondylolisthesis (DLS), and normal lumbar in a proportional manner. Subsequently, the images were randomly divided into training, validation, and test sets to establish a classification recognition network. The model training and validation process utilized the EfficientNetV2-M network. The model’s ability to generalize was assessed by conducting a rigorous evaluation on an entirely independent test set and comparing its performance with the diagnoses made by three orthopedists and three radiologists. The evaluation metrics employed to assess the model’s performance included accuracy, sensitivity, specificity, and <i>F</i>1 score. Additionally, the weight distribution of the network was visualized using gradient-weighted class activation mapping (Grad-CAM). For the doctor group, accuracy ranged from 87.9 to 90.0% (mean, 89.0%), precision ranged from 87.2 to 90.5% (mean, 89.0%), sensitivity ranged from 87.1 to 91.0% (mean, 89.2%), specificity ranged from 93.7 to 94.7% (mean, 94.3%), and <i>F</i>1 score ranged from 88.2 to 89.9% (mean, 89.1%). The DCNN model had accuracy of 92.0%, precision of 91.9%, sensitivity of 92.2%, specificity of 95.7%, and <i>F</i>1 score of 92.0%. Grad-CAM exhibited concentrations of highlighted areas in the intervertebral foraminal region. We developed a DCNN model that intelligently distinguished spondylolysis or spondylolisthesis on lumbar lateral or lumbar dynamic radiographs.</p>","PeriodicalId":50214,"journal":{"name":"Journal of Digital Imaging","volume":"50 1","pages":""},"PeriodicalIF":2.9000,"publicationDate":"2024-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Digital Imaging","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s10278-024-01115-9","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}
引用次数: 0
Abstract
We aimed to develop and validate a deep convolutional neural network (DCNN) model capable of accurately identifying spondylolysis or spondylolisthesis on lateral or dynamic X-ray images. A total of 2449 lumbar lateral and dynamic X-ray images were collected from two tertiary hospitals. These images were categorized into lumbar spondylolysis (LS), degenerative lumbar spondylolisthesis (DLS), and normal lumbar in a proportional manner. Subsequently, the images were randomly divided into training, validation, and test sets to establish a classification recognition network. The model training and validation process utilized the EfficientNetV2-M network. The model’s ability to generalize was assessed by conducting a rigorous evaluation on an entirely independent test set and comparing its performance with the diagnoses made by three orthopedists and three radiologists. The evaluation metrics employed to assess the model’s performance included accuracy, sensitivity, specificity, and F1 score. Additionally, the weight distribution of the network was visualized using gradient-weighted class activation mapping (Grad-CAM). For the doctor group, accuracy ranged from 87.9 to 90.0% (mean, 89.0%), precision ranged from 87.2 to 90.5% (mean, 89.0%), sensitivity ranged from 87.1 to 91.0% (mean, 89.2%), specificity ranged from 93.7 to 94.7% (mean, 94.3%), and F1 score ranged from 88.2 to 89.9% (mean, 89.1%). The DCNN model had accuracy of 92.0%, precision of 91.9%, sensitivity of 92.2%, specificity of 95.7%, and F1 score of 92.0%. Grad-CAM exhibited concentrations of highlighted areas in the intervertebral foraminal region. We developed a DCNN model that intelligently distinguished spondylolysis or spondylolisthesis on lumbar lateral or lumbar dynamic radiographs.
期刊介绍:
The Journal of Digital Imaging (JDI) is the official peer-reviewed journal of the Society for Imaging Informatics in Medicine (SIIM). JDI’s goal is to enhance the exchange of knowledge encompassed by the general topic of Imaging Informatics in Medicine such as research and practice in clinical, engineering, and information technologies and techniques in all medical imaging environments. JDI topics are of interest to researchers, developers, educators, physicians, and imaging informatics professionals.
Suggested Topics
PACS and component systems; imaging informatics for the enterprise; image-enabled electronic medical records; RIS and HIS; digital image acquisition; image processing; image data compression; 3D, visualization, and multimedia; speech recognition; computer-aided diagnosis; facilities design; imaging vocabularies and ontologies; Transforming the Radiological Interpretation Process (TRIP™); DICOM and other standards; workflow and process modeling and simulation; quality assurance; archive integrity and security; teleradiology; digital mammography; and radiological informatics education.