The Classification of Lumbar Spondylolisthesis X-Ray Images Using Convolutional Neural Networks

IF 3.8 2区工程技术 Q2 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING

Journal of Digital Imaging Pub Date : 2024-04-18 DOI:10.1007/s10278-024-01115-9

Wutong Chen, Du Junsheng, Yanzhen Chen, Yifeng Fan, Hengzhi Liu, Chang Tan, Xuanming Shao, Xinzhi Li

{"title":"The Classification of Lumbar Spondylolisthesis X-Ray Images Using Convolutional Neural Networks","authors":"Wutong Chen, Du Junsheng, Yanzhen Chen, Yifeng Fan, Hengzhi Liu, Chang Tan, Xuanming Shao, Xinzhi Li","doi":"10.1007/s10278-024-01115-9","DOIUrl":null,"url":null,"abstract":"We aimed to develop and validate a deep convolutional neural network (DCNN) model capable of accurately identifying spondylolysis or spondylolisthesis on lateral or dynamic X-ray images. A total of 2449 lumbar lateral and dynamic X-ray images were collected from two tertiary hospitals. These images were categorized into lumbar spondylolysis (LS), degenerative lumbar spondylolisthesis (DLS), and normal lumbar in a proportional manner. Subsequently, the images were randomly divided into training, validation, and test sets to establish a classification recognition network. The model training and validation process utilized the EfficientNetV2-M network. The model’s ability to generalize was assessed by conducting a rigorous evaluation on an entirely independent test set and comparing its performance with the diagnoses made by three orthopedists and three radiologists. The evaluation metrics employed to assess the model’s performance included accuracy, sensitivity, specificity, and F1 score. Additionally, the weight distribution of the network was visualized using gradient-weighted class activation mapping (Grad-CAM). For the doctor group, accuracy ranged from 87.9 to 90.0% (mean, 89.0%), precision ranged from 87.2 to 90.5% (mean, 89.0%), sensitivity ranged from 87.1 to 91.0% (mean, 89.2%), specificity ranged from 93.7 to 94.7% (mean, 94.3%), and F1 score ranged from 88.2 to 89.9% (mean, 89.1%). The DCNN model had accuracy of 92.0%, precision of 91.9%, sensitivity of 92.2%, specificity of 95.7%, and F1 score of 92.0%. Grad-CAM exhibited concentrations of highlighted areas in the intervertebral foraminal region. We developed a DCNN model that intelligently distinguished spondylolysis or spondylolisthesis on lumbar lateral or lumbar dynamic radiographs.","PeriodicalId":50214,"journal":{"name":"Journal of Digital Imaging","volume":"50 1","pages":""},"PeriodicalIF":3.8000,"publicationDate":"2024-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Digital Imaging","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1007/s10278-024-01115-9","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}

引用次数: 0

Abstract

We aimed to develop and validate a deep convolutional neural network (DCNN) model capable of accurately identifying spondylolysis or spondylolisthesis on lateral or dynamic X-ray images. A total of 2449 lumbar lateral and dynamic X-ray images were collected from two tertiary hospitals. These images were categorized into lumbar spondylolysis (LS), degenerative lumbar spondylolisthesis (DLS), and normal lumbar in a proportional manner. Subsequently, the images were randomly divided into training, validation, and test sets to establish a classification recognition network. The model training and validation process utilized the EfficientNetV2-M network. The model’s ability to generalize was assessed by conducting a rigorous evaluation on an entirely independent test set and comparing its performance with the diagnoses made by three orthopedists and three radiologists. The evaluation metrics employed to assess the model’s performance included accuracy, sensitivity, specificity, and F1 score. Additionally, the weight distribution of the network was visualized using gradient-weighted class activation mapping (Grad-CAM). For the doctor group, accuracy ranged from 87.9 to 90.0% (mean, 89.0%), precision ranged from 87.2 to 90.5% (mean, 89.0%), sensitivity ranged from 87.1 to 91.0% (mean, 89.2%), specificity ranged from 93.7 to 94.7% (mean, 94.3%), and F1 score ranged from 88.2 to 89.9% (mean, 89.1%). The DCNN model had accuracy of 92.0%, precision of 91.9%, sensitivity of 92.2%, specificity of 95.7%, and F1 score of 92.0%. Grad-CAM exhibited concentrations of highlighted areas in the intervertebral foraminal region. We developed a DCNN model that intelligently distinguished spondylolysis or spondylolisthesis on lumbar lateral or lumbar dynamic radiographs.

Abstract Image

查看原文本刊更多论文

利用卷积神经网络对腰椎骨质增生 X 射线图像进行分类

我们旨在开发并验证一种深度卷积神经网络（DCNN）模型，该模型能够在侧位或动态 X 光图像上准确识别脊柱溶解症或脊柱滑脱症。我们从两家三甲医院共收集了 2449 张腰椎侧位和动态 X 光图像。这些图像按比例分为腰椎溶解症（LS）、退行性腰椎滑脱症（DLS）和正常腰椎。随后，将图像随机分为训练集、验证集和测试集，以建立分类识别网络。模型的训练和验证过程使用了 EfficientNetV2-M 网络。通过在完全独立的测试集上进行严格评估，并将其性能与三位骨科医生和三位放射科医生的诊断结果进行比较，评估了该模型的通用能力。用于评估模型性能的评价指标包括准确性、灵敏度、特异性和 F1 分数。此外，还使用梯度加权类激活图谱（Grad-CAM）对网络的权重分布进行了可视化。医生组的准确率为 87.9% 至 90.0%（平均值为 89.0%），精确度为 87.2% 至 90.5%（平均值为 89.0%），灵敏度为 87.1% 至 91.0%（平均值为 89.2%），特异性为 93.7% 至 94.7%（平均值为 94.3%），F1 分数为 88.2% 至 89.9%（平均值为 89.1%）。DCNN 模型的准确度为 92.0%，精确度为 91.9%，灵敏度为 92.2%，特异性为 95.7%，F1 分数为 92.0%。Grad-CAM 显示了椎间孔区域的高亮区域集中。我们开发了一种 DCNN 模型，可智能区分腰椎侧位片或腰椎动态X光片上的脊柱溶解症或脊柱滑脱症。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Digital Imaging 医学-核医学

CiteScore

7.50

自引率

6.80%

发文量

192

审稿时长

6-12 weeks

期刊介绍： The Journal of Digital Imaging (JDI) is the official peer-reviewed journal of the Society for Imaging Informatics in Medicine (SIIM). JDI’s goal is to enhance the exchange of knowledge encompassed by the general topic of Imaging Informatics in Medicine such as research and practice in clinical, engineering, and information technologies and techniques in all medical imaging environments. JDI topics are of interest to researchers, developers, educators, physicians, and imaging informatics professionals. Suggested Topics PACS and component systems; imaging informatics for the enterprise; image-enabled electronic medical records; RIS and HIS; digital image acquisition; image processing; image data compression; 3D, visualization, and multimedia; speech recognition; computer-aided diagnosis; facilities design; imaging vocabularies and ontologies; Transforming the Radiological Interpretation Process (TRIP™); DICOM and other standards; workflow and process modeling and simulation; quality assurance; archive integrity and security; teleradiology; digital mammography; and radiological informatics education.