Aric Lee, Junran Wu, Changshuo Liu, Andrew Makmur, Yong Han Ting, Shannon Lee, Matthew Ding Zhou Chan, Desmond Shi Wei Lim, Vanessa Mei Hui Khoo, Jonathan Sng, Han Yang Ong, Amos Tan, Shuliang Ge, Faimee Erwan Muhamat Nor, Yi Ting Lim, Joey Chan Yiing Beh, Qai Ven Yap, Jiong Hao Tan, Naresh Kumar, Beng Chin Ooi, James Thomas Patrick Decourcy Hallinan
{"title":"Using Deep Learning to Enhance Reporting Efficiency and Accuracy in Degenerative Cervical Spine MRI.","authors":"Aric Lee, Junran Wu, Changshuo Liu, Andrew Makmur, Yong Han Ting, Shannon Lee, Matthew Ding Zhou Chan, Desmond Shi Wei Lim, Vanessa Mei Hui Khoo, Jonathan Sng, Han Yang Ong, Amos Tan, Shuliang Ge, Faimee Erwan Muhamat Nor, Yi Ting Lim, Joey Chan Yiing Beh, Qai Ven Yap, Jiong Hao Tan, Naresh Kumar, Beng Chin Ooi, James Thomas Patrick Decourcy Hallinan","doi":"10.1016/j.spinee.2025.03.009","DOIUrl":null,"url":null,"abstract":"<p><strong>Background context: </strong>Cervical spine MRI is essential for evaluating degenerative cervical spondylosis (DCS) but is time-consuming to report and subject to interobserver variability. The integration of artificial intelligence in medical imaging offers potential solutions to enhance productivity and diagnostic consistency.</p><p><strong>Purpose: </strong>To assess whether a transformer-based deep learning model (DLM) can improve the efficiency and accuracy of radiologists in reporting DCS MRIs.</p><p><strong>Study design/setting: </strong>Retrospective study using external DCS MRIs from December 2015 to August 2018.</p><p><strong>Patient sample: </strong>The test dataset comprised 50 pre-operative DCS MRIs (2,555 images) from 50 patients (mean age = 60 years ± SD 14; 13 women (26%)), excluding cases with instrumentation.</p><p><strong>Outcome measures: </strong>Primary outcomes were interpretation time and interobserver agreement (Gwet's kappa) among radiologists grading spinal canal and neural foramina stenosis with and without DLM-assistance.</p><p><strong>Methods: </strong>A transformer-based DLM was used to classify spinal canal (grades 0/1/2/3) and neural foramina (grades 0/1/2) stenosis at each disc level. Two experienced musculoskeletal radiologists (both with 12-years-of-experience) provided reference standard labels in consensus. Ten radiologists (0-7 years of experience) graded DCS MRIs with and without DLM-assistance, with a 1-month washout period between sessions to minimize recall bias. Interpretation time and interobserver agreement were assessed.</p><p><strong>Results: </strong>DLM-assistance significantly improved interpretation time by 69-308 seconds (p < 0.001), reducing mean time from 159-490 seconds (SD 27-649) to 90-182 seconds (SD 42-218). Radiology residents experienced the largest time savings. DLM-assistance improved interobserver agreement across all stenosis gradings compared to baseline. For dichotomous spinal canal grading, residents had the largest improvement in agreement (κ = 0.63 to 0.77, p < 0.001). Conversely, for dichotomous neural foramina grading, musculoskeletal radiologists had the largest improvement (κ = 0.60 to 0.72, p < 0.001). Notably, independent DLM performance alone was equivalent or superior to all readers.</p><p><strong>Conclusions: </strong>The integration of a deep learning model into the radiological assessment of DCS MRI improved radiologists' interpretation time and interobserver agreement, regardless of experience level.</p>","PeriodicalId":49484,"journal":{"name":"Spine Journal","volume":" ","pages":""},"PeriodicalIF":4.9000,"publicationDate":"2025-03-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Spine Journal","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.spinee.2025.03.009","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background context: Cervical spine MRI is essential for evaluating degenerative cervical spondylosis (DCS) but is time-consuming to report and subject to interobserver variability. The integration of artificial intelligence in medical imaging offers potential solutions to enhance productivity and diagnostic consistency.
Purpose: To assess whether a transformer-based deep learning model (DLM) can improve the efficiency and accuracy of radiologists in reporting DCS MRIs.
Study design/setting: Retrospective study using external DCS MRIs from December 2015 to August 2018.
Patient sample: The test dataset comprised 50 pre-operative DCS MRIs (2,555 images) from 50 patients (mean age = 60 years ± SD 14; 13 women (26%)), excluding cases with instrumentation.
Outcome measures: Primary outcomes were interpretation time and interobserver agreement (Gwet's kappa) among radiologists grading spinal canal and neural foramina stenosis with and without DLM-assistance.
Methods: A transformer-based DLM was used to classify spinal canal (grades 0/1/2/3) and neural foramina (grades 0/1/2) stenosis at each disc level. Two experienced musculoskeletal radiologists (both with 12-years-of-experience) provided reference standard labels in consensus. Ten radiologists (0-7 years of experience) graded DCS MRIs with and without DLM-assistance, with a 1-month washout period between sessions to minimize recall bias. Interpretation time and interobserver agreement were assessed.
Results: DLM-assistance significantly improved interpretation time by 69-308 seconds (p < 0.001), reducing mean time from 159-490 seconds (SD 27-649) to 90-182 seconds (SD 42-218). Radiology residents experienced the largest time savings. DLM-assistance improved interobserver agreement across all stenosis gradings compared to baseline. For dichotomous spinal canal grading, residents had the largest improvement in agreement (κ = 0.63 to 0.77, p < 0.001). Conversely, for dichotomous neural foramina grading, musculoskeletal radiologists had the largest improvement (κ = 0.60 to 0.72, p < 0.001). Notably, independent DLM performance alone was equivalent or superior to all readers.
Conclusions: The integration of a deep learning model into the radiological assessment of DCS MRI improved radiologists' interpretation time and interobserver agreement, regardless of experience level.
期刊介绍:
The Spine Journal, the official journal of the North American Spine Society, is an international and multidisciplinary journal that publishes original, peer-reviewed articles on research and treatment related to the spine and spine care, including basic science and clinical investigations. It is a condition of publication that manuscripts submitted to The Spine Journal have not been published, and will not be simultaneously submitted or published elsewhere. The Spine Journal also publishes major reviews of specific topics by acknowledged authorities, technical notes, teaching editorials, and other special features, Letters to the Editor-in-Chief are encouraged.