Using deep learning to enhance reporting efficiency and accuracy in degenerative cervical spine MRI.

IF 4.9 1区医学 Q1 CLINICAL NEUROLOGY

Spine Journal Pub Date : 2025-03-26 DOI:10.1016/j.spinee.2025.03.009

Aric Lee, Junran Wu, Changshuo Liu, Andrew Makmur, Yong Han Ting, Shannon Lee, Matthew Ding Zhou Chan, Desmond Shi Wei Lim, Vanessa Mei Hui Khoo, Jonathan Sng, Han Yang Ong, Amos Tan, Shuliang Ge, Faimee Erwan Muhamat Nor, Yi Ting Lim, Joey Chan Yiing Beh, Qai Ven Yap, Jiong Hao Tan, Naresh Kumar, Beng Chin Ooi, James Thomas Patrick Decourcy Hallinan

{"title":"Using deep learning to enhance reporting efficiency and accuracy in degenerative cervical spine MRI.","authors":"Aric Lee, Junran Wu, Changshuo Liu, Andrew Makmur, Yong Han Ting, Shannon Lee, Matthew Ding Zhou Chan, Desmond Shi Wei Lim, Vanessa Mei Hui Khoo, Jonathan Sng, Han Yang Ong, Amos Tan, Shuliang Ge, Faimee Erwan Muhamat Nor, Yi Ting Lim, Joey Chan Yiing Beh, Qai Ven Yap, Jiong Hao Tan, Naresh Kumar, Beng Chin Ooi, James Thomas Patrick Decourcy Hallinan","doi":"10.1016/j.spinee.2025.03.009","DOIUrl":null,"url":null,"abstract":"Background context: Cervical spine MRI is essential for evaluating degenerative cervical spondylosis (DCS) but is time-consuming to report and subject to interobserver variability. The integration of artificial intelligence in medical imaging offers potential solutions to enhance productivity and diagnostic consistency.Purpose: To assess whether a transformer-based deep learning model (DLM) can improve the efficiency and accuracy of radiologists in reporting DCS MRIs.Study design/setting: Retrospective study using external DCS MRIs from December 2015 to August 2018.Patient sample: The test dataset comprised 50 preoperative DCS MRIs (2,555 images) from 50 patients (mean age = 60 years ± SD 14; 13 women [26%]), excluding cases with instrumentation.Outcome measures: Primary outcomes were interpretation time and interobserver agreement (Gwet's kappa) among radiologists grading spinal canal and neural foramina stenosis with and without DLM-assistance.Methods: A transformer-based DLM was used to classify spinal canal (grades 0/1/2/3) and neural foramina (grades 0/1/2) stenosis at each disc level. Two experienced musculoskeletal radiologists (both with 12-years-of-experience) provided reference standard labels in consensus. Ten radiologists (0-7 years of experience) graded DCS MRIs with and without DLM-assistance, with a 1-month washout period between sessions to minimize recall bias. Interpretation time and interobserver agreement were assessed.Results: DLM-assistance significantly improved interpretation time by 69 to 308 s (p<.001), reducing mean time from 159-490 s (SD 27-649) to 90-182 s (SD 42-218). Radiology residents experienced the largest time savings. DLM-assistance improved interobserver agreement across all stenosis gradings compared to baseline. For dichotomous spinal canal grading, residents had the largest improvement in agreement (κ = 0.63 to 0.77, p<.001). Conversely, for dichotomous neural foramina grading, musculoskeletal radiologists had the largest improvement (κ=0.60 to 0.72, p<.001). Notably, independent DLM performance alone was equivalent or superior to all readers.Conclusions: The integration of a deep learning model into the radiological assessment of DCS MRI improved radiologists' interpretation time and interobserver agreement, regardless of experience level.","PeriodicalId":49484,"journal":{"name":"Spine Journal","volume":" ","pages":""},"PeriodicalIF":4.9000,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Spine Journal","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.spinee.2025.03.009","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Background context: Cervical spine MRI is essential for evaluating degenerative cervical spondylosis (DCS) but is time-consuming to report and subject to interobserver variability. The integration of artificial intelligence in medical imaging offers potential solutions to enhance productivity and diagnostic consistency.

Purpose: To assess whether a transformer-based deep learning model (DLM) can improve the efficiency and accuracy of radiologists in reporting DCS MRIs.

Study design/setting: Retrospective study using external DCS MRIs from December 2015 to August 2018.

Patient sample: The test dataset comprised 50 preoperative DCS MRIs (2,555 images) from 50 patients (mean age = 60 years ± SD 14; 13 women [26%]), excluding cases with instrumentation.

Outcome measures: Primary outcomes were interpretation time and interobserver agreement (Gwet's kappa) among radiologists grading spinal canal and neural foramina stenosis with and without DLM-assistance.

Methods: A transformer-based DLM was used to classify spinal canal (grades 0/1/2/3) and neural foramina (grades 0/1/2) stenosis at each disc level. Two experienced musculoskeletal radiologists (both with 12-years-of-experience) provided reference standard labels in consensus. Ten radiologists (0-7 years of experience) graded DCS MRIs with and without DLM-assistance, with a 1-month washout period between sessions to minimize recall bias. Interpretation time and interobserver agreement were assessed.

Results: DLM-assistance significantly improved interpretation time by 69 to 308 s (p<.001), reducing mean time from 159-490 s (SD 27-649) to 90-182 s (SD 42-218). Radiology residents experienced the largest time savings. DLM-assistance improved interobserver agreement across all stenosis gradings compared to baseline. For dichotomous spinal canal grading, residents had the largest improvement in agreement (κ = 0.63 to 0.77, p<.001). Conversely, for dichotomous neural foramina grading, musculoskeletal radiologists had the largest improvement (κ=0.60 to 0.72, p<.001). Notably, independent DLM performance alone was equivalent or superior to all readers.

Conclusions: The integration of a deep learning model into the radiological assessment of DCS MRI improved radiologists' interpretation time and interobserver agreement, regardless of experience level.

查看原文本刊更多论文

利用深度学习提高退行性颈椎MRI报告的效率和准确性。

背景情况：颈椎核磁共振成像（MRI）是评估退行性颈椎病（DCS）的必要手段，但报告耗时且受观察者间差异的影响。目的：评估基于变压器的深度学习模型（DLM）能否提高放射科医生报告 DCS MRI 的效率和准确性：回顾性研究，使用2015年12月至2018年8月的外部DCS MRI.患者样本：测试数据集包括50名患者（平均年龄=60岁±SD 14；13名女性（26%））的50张术前DCS MRI（2 555张图像），不包括有器械的病例：主要结果是放射科医生在有 DLM 辅助和无 DLM 辅助的情况下对椎管和神经孔狭窄进行分级的判读时间和观察者之间的一致性（Gwet's kappa）：方法：使用基于变压器的 DLM 对每个椎间盘水平的椎管（0/1/2/3 级）和神经孔（0/1/2 级）狭窄进行分级。两名经验丰富的肌肉骨骼放射科医生（均有 12 年经验）在达成共识的基础上提供了参考标准标签。十名放射科医生（0-7 年经验）在有 DLM 辅助和无 DLM 辅助的情况下对 DCS MRI 进行分级，两次分级之间有 1 个月的缓冲期，以尽量减少回忆偏差。对判读时间和观察者之间的一致性进行了评估：结果：DLM辅助大大缩短了判读时间，缩短了69-308秒（p 结论：将深度学习模型集成到计算机系统中，可以大大缩短判读时间：将深度学习模型整合到 DCS MRI 的放射学评估中可改善放射医师的判读时间和观察者之间的一致性，而与经验水平无关。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Spine Journal 医学-临床神经学

CiteScore

8.20

自引率

6.70%

发文量

680

审稿时长

13.1 weeks

期刊介绍： The Spine Journal, the official journal of the North American Spine Society, is an international and multidisciplinary journal that publishes original, peer-reviewed articles on research and treatment related to the spine and spine care, including basic science and clinical investigations. It is a condition of publication that manuscripts submitted to The Spine Journal have not been published, and will not be simultaneously submitted or published elsewhere. The Spine Journal also publishes major reviews of specific topics by acknowledged authorities, technical notes, teaching editorials, and other special features, Letters to the Editor-in-Chief are encouraged.