AI-Based Measurement of Lumbar Spinal Stenosis on MRI: External Evaluation of a Fully Automated Model.

IF 7 1区医学 Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING

Investigative Radiology Pub Date : 2024-09-01 Epub Date: 2024-03-01 DOI:10.1097/RLI.0000000000001070

Sanja Bogdanovic, Matthias Staib, Marco Schleiniger, Livio Steiner, Leonardo Schwarz, Christoph Germann, Reto Sutter, Benjamin Fritz

{"title":"AI-Based Measurement of Lumbar Spinal Stenosis on MRI: External Evaluation of a Fully Automated Model.","authors":"Sanja Bogdanovic, Matthias Staib, Marco Schleiniger, Livio Steiner, Leonardo Schwarz, Christoph Germann, Reto Sutter, Benjamin Fritz","doi":"10.1097/RLI.0000000000001070","DOIUrl":null,"url":null,"abstract":"Objectives: The aim of this study was to clinically validate a fully automated AI model for magnetic resonance imaging (MRI)-based quantifications of lumbar spinal canal stenosis.Materials and methods: This retrospective study included lumbar spine MRI of 100 consecutive clinical patients (56 ± 17 years; 43 females, 57 males) performed on clinical 1.5 (51 examinations) and 3 T MRI scanners (49 examinations) with heterogeneous clinical imaging protocols. The AI model performed segmentations of the thecal sac on axial T2-weighted sequences. Based on these segmentations, the anteroposterior (AP) and mediolateral (ML) distance, and the area of the thecal sac were measured in a fully automated manner. For comparison, 2 fellowship-trained musculoskeletal radiologists performed the same segmentations and measurements independently. Statistics included 1-sample t tests, the intraclass correlation coefficient (ICC), Bland-Altman plots, and Dice coefficients. A P value of <0.05 was considered statistically significant.Results: The average measurements of the AI model, reader 1, and reader 2 were 194 ± 72 mm 2 , 181 ± 71 mm 2 , and 179 ± 70 mm 2 for thecal sac area, 13 ± 3.3 mm, 12.6 ± 3.3 mm, and 12.6 ± 3.2 mm for AP distance, and 19.5 ± 3.9 mm, 20 ± 4.3 mm, and 19.4 ± 4 mm for ML distance, respectively. Significant differences existed for all pairwise comparisons, besides reader 1 versus AI model for the ML distance and reader 1 versus reader 2 for the AP distance ( P = 0.1 and P = 0.21, respectively). The pairwise mean absolute errors among reader 1, reader 2, and the AI model ranged from 0.59 mm and 0.75 mm for the AP distance, from 1.16 mm to 1.37 mm for the ML distance, and from 7.9 mm 2 to 15.54 mm 2 for the thecal sac area. Pairwise ICCs among reader 1, reader 2, and the AI model ranged from 0.91 and 0.94 for the AP distance and from 0.86 to 0.9 for the ML distance without significant differences. For the thecal sac area, the pairwise ICC between both readers and the AI model of 0.97 each was slightly, but significantly lower than the ICC between reader 1 and reader 2 of 0.99. Similarly, the Dice coefficient and Hausdorff distance between both readers and the AI model were significantly lower than the values between reader 1 and reader 2, overall ranging from 0.93 to 0.95 for the Dice coefficients and 1.1 to 1.44 for the Hausdorff distances.Conclusions: The investigated AI model is reliable for assessing the AP and the ML thecal sac diameters with human level accuracies. The small differences for measurement and segmentation of the thecal sac area between the AI model and the radiologists are likely within a clinically acceptable range.","PeriodicalId":14486,"journal":{"name":"Investigative Radiology","volume":null,"pages":null},"PeriodicalIF":7.0000,"publicationDate":"2024-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Investigative Radiology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1097/RLI.0000000000001070","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/3/1 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}

引用次数: 0

Abstract

Objectives: The aim of this study was to clinically validate a fully automated AI model for magnetic resonance imaging (MRI)-based quantifications of lumbar spinal canal stenosis.

Materials and methods: This retrospective study included lumbar spine MRI of 100 consecutive clinical patients (56 ± 17 years; 43 females, 57 males) performed on clinical 1.5 (51 examinations) and 3 T MRI scanners (49 examinations) with heterogeneous clinical imaging protocols. The AI model performed segmentations of the thecal sac on axial T2-weighted sequences. Based on these segmentations, the anteroposterior (AP) and mediolateral (ML) distance, and the area of the thecal sac were measured in a fully automated manner. For comparison, 2 fellowship-trained musculoskeletal radiologists performed the same segmentations and measurements independently. Statistics included 1-sample t tests, the intraclass correlation coefficient (ICC), Bland-Altman plots, and Dice coefficients. A P value of <0.05 was considered statistically significant.

Results: The average measurements of the AI model, reader 1, and reader 2 were 194 ± 72 mm 2 , 181 ± 71 mm 2 , and 179 ± 70 mm 2 for thecal sac area, 13 ± 3.3 mm, 12.6 ± 3.3 mm, and 12.6 ± 3.2 mm for AP distance, and 19.5 ± 3.9 mm, 20 ± 4.3 mm, and 19.4 ± 4 mm for ML distance, respectively. Significant differences existed for all pairwise comparisons, besides reader 1 versus AI model for the ML distance and reader 1 versus reader 2 for the AP distance ( P = 0.1 and P = 0.21, respectively). The pairwise mean absolute errors among reader 1, reader 2, and the AI model ranged from 0.59 mm and 0.75 mm for the AP distance, from 1.16 mm to 1.37 mm for the ML distance, and from 7.9 mm 2 to 15.54 mm 2 for the thecal sac area. Pairwise ICCs among reader 1, reader 2, and the AI model ranged from 0.91 and 0.94 for the AP distance and from 0.86 to 0.9 for the ML distance without significant differences. For the thecal sac area, the pairwise ICC between both readers and the AI model of 0.97 each was slightly, but significantly lower than the ICC between reader 1 and reader 2 of 0.99. Similarly, the Dice coefficient and Hausdorff distance between both readers and the AI model were significantly lower than the values between reader 1 and reader 2, overall ranging from 0.93 to 0.95 for the Dice coefficients and 1.1 to 1.44 for the Hausdorff distances.

Conclusions: The investigated AI model is reliable for assessing the AP and the ML thecal sac diameters with human level accuracies. The small differences for measurement and segmentation of the thecal sac area between the AI model and the radiologists are likely within a clinically acceptable range.

查看原文本刊更多论文

基于人工智能的磁共振成像腰椎管狭窄测量：全自动模型的外部评估

研究目的本研究旨在对基于磁共振成像（MRI）量化腰椎管狭窄的全自动人工智能模型进行临床验证：这项回顾性研究包括在临床 1.5 T（51 次检查）和 3 T MRI 扫描仪（49 次检查）上对 100 名连续临床患者（56 ± 17 岁；43 名女性，57 名男性）进行的腰椎 MRI 检查，临床成像方案各不相同。人工智能模型在轴向 T2 加权序列上对椎管囊进行了分割。在这些分割的基础上，以全自动的方式测量了椎管内囊的前后（AP）距离、内外侧（ML）距离和面积。为了进行比较，两名受过研究培训的肌肉骨骼放射科医生独立完成了相同的分割和测量。统计数据包括单样本 t 检验、类内相关系数 (ICC)、Bland-Altman 图和 Dice 系数。结果的 P 值：AI 模型、阅读器 1 和阅读器 2 的平均测量结果分别为：膀胱囊面积为 194 ± 72 平方毫米、181 ± 71 平方毫米和 179 ± 70 平方毫米；AP 距离为 13 ± 3.3 毫米、12.6 ± 3.3 毫米和 12.6 ± 3.2 毫米；ML 距离为 19.5 ± 3.9 毫米、20 ± 4.3 毫米和 19.4 ± 4 毫米。除了阅读器 1 与人工智能模型的 ML 距离比较和阅读器 1 与阅读器 2 的 AP 距离比较存在显著差异外（P = 0.1 和 P = 0.21），所有成对比较均存在显著差异。阅读器 1、阅读器 2 和 AI 模型之间的成对平均绝对误差范围为：AP 距离为 0.59 毫米至 0.75 毫米，ML 距离为 1.16 毫米至 1.37 毫米，睾丸囊面积为 7.9 平方毫米至 15.54 平方毫米。阅读器 1、阅读器 2 和 AI 模型之间的配对 ICC 在 AP 距离上介于 0.91 和 0.94 之间，在 ML 距离上介于 0.86 和 0.9 之间，无显著差异。在钙囊面积方面，两个读者和人工智能模型之间的成对 ICC 均为 0.97，略低于读者 1 和读者 2 之间的 ICC 0.99，但差异显著。同样，两名读者与人工智能模型之间的狄斯系数和豪斯多夫距离也明显低于读者 1 和读者 2 之间的值，狄斯系数的总体范围为 0.93 至 0.95，豪斯多夫距离的总体范围为 1.1 至 1.44：所研究的人工智能模型可以可靠地评估 AP 和 ML 膀胱囊直径，其精确度达到了人类水平。人工智能模型与放射科医生在测量和分割椎囊面积方面的微小差异可能在临床可接受的范围内。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Investigative Radiology 医学-核医学

CiteScore

15.10

自引率

16.40%

发文量

188

审稿时长

4-8 weeks

期刊介绍： Investigative Radiology publishes original, peer-reviewed reports on clinical and laboratory investigations in diagnostic imaging, the diagnostic use of radioactive isotopes, computed tomography, positron emission tomography, magnetic resonance imaging, ultrasound, digital subtraction angiography, and related modalities. Emphasis is on early and timely publication. Primarily research-oriented, the journal also includes a wide variety of features of interest to clinical radiologists.