Lumbar Spinal Stenosis Grading in Multiple Level Magnetic Resonance Imaging Using Deep Convolutional Neural Networks.

IF 4.6 Q2 MATERIALS SCIENCE, BIOMATERIALS
Dongkyu Won, Hyun-Joo Lee, Suk-Joong Lee, Sang Hyun Park
{"title":"Lumbar Spinal Stenosis Grading in Multiple Level Magnetic Resonance Imaging Using Deep Convolutional Neural Networks.","authors":"Dongkyu Won, Hyun-Joo Lee, Suk-Joong Lee, Sang Hyun Park","doi":"10.1177/21925682241299332","DOIUrl":null,"url":null,"abstract":"<p><strong>Study design: </strong>Retrospective magnetic resonance imaging grading with comparison between experts and deep convolutional neural networks (CNNs).</p><p><strong>Objective: </strong>The application of deep learning to clinical diagnosis has gained popularity. This approach can accelerate image interpretation and serve as a screening tool to help doctors.</p><p><strong>Methods: </strong>A comparison was conducted between retrospective magnetic resonance imaging (MRI) grading performed by experts and grading obtained using CNN classifiers. Data were collected from the lumbar axial dataset in the DICOM format. Two experts labeled the sampled images using the same diagnostic tools: localization of patches near the spinal canal, rootlet leveling, and stenosis grading. Comprehensive comparisons were presented for both rootlet cord classification and stenosis grading.</p><p><strong>Results: </strong>Rootlet-cord classification for the two analyzers was 90.3% and the F1 score was 86.6%. The agreement of Analyzers-Classifiers was 92.7% and 96.8% for data with 90.6% and 95.6% F1 scores, respectively. For stenosis grading, there was an agreement of 89.2% between the two analyzers, resulting in an F1 score of 76.5%. The grades of the Analyzers-Classifiers agreed on 91.5/89.4% of the data, with an F1 score of 78.4/75.7%. Analyzer1 and Analyzer2 classified >74% as grade A (78.8% and 74.4%, respectively), 15.4% and 18.6% as grade B, 4.2% and 6.0% as grade C, and 1.6% and 2.0% as grade D, respectively.</p><p><strong>Conclusions: </strong>The fully automated deep learning model showed competitive results in stenosis grade diagnosis and rootlet cord classification under similar anatomical conditions. However, abrupt anatomical changes can lead to a puzzle diagnosis based only on images.</p>","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11559735/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/21925682241299332","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}
引用次数: 0

Abstract

Study design: Retrospective magnetic resonance imaging grading with comparison between experts and deep convolutional neural networks (CNNs).

Objective: The application of deep learning to clinical diagnosis has gained popularity. This approach can accelerate image interpretation and serve as a screening tool to help doctors.

Methods: A comparison was conducted between retrospective magnetic resonance imaging (MRI) grading performed by experts and grading obtained using CNN classifiers. Data were collected from the lumbar axial dataset in the DICOM format. Two experts labeled the sampled images using the same diagnostic tools: localization of patches near the spinal canal, rootlet leveling, and stenosis grading. Comprehensive comparisons were presented for both rootlet cord classification and stenosis grading.

Results: Rootlet-cord classification for the two analyzers was 90.3% and the F1 score was 86.6%. The agreement of Analyzers-Classifiers was 92.7% and 96.8% for data with 90.6% and 95.6% F1 scores, respectively. For stenosis grading, there was an agreement of 89.2% between the two analyzers, resulting in an F1 score of 76.5%. The grades of the Analyzers-Classifiers agreed on 91.5/89.4% of the data, with an F1 score of 78.4/75.7%. Analyzer1 and Analyzer2 classified >74% as grade A (78.8% and 74.4%, respectively), 15.4% and 18.6% as grade B, 4.2% and 6.0% as grade C, and 1.6% and 2.0% as grade D, respectively.

Conclusions: The fully automated deep learning model showed competitive results in stenosis grade diagnosis and rootlet cord classification under similar anatomical conditions. However, abrupt anatomical changes can lead to a puzzle diagnosis based only on images.

利用深度卷积神经网络对多层次磁共振成像中的腰椎管狭窄症进行分级
研究设计回顾性磁共振成像分级,比较专家和深度卷积神经网络(CNN):深度学习在临床诊断中的应用越来越受欢迎。这种方法可以加速图像解读,并作为一种筛查工具帮助医生:对专家进行的回顾性磁共振成像(MRI)分级和使用 CNN 分类器获得的分级进行了比较。数据来自 DICOM 格式的腰椎轴向数据集。两位专家使用相同的诊断工具对采样图像进行了标注:椎管附近斑块定位、小根平整和狭窄分级。结果显示,两位专家的小根脊髓分类和椎管狭窄分级都进行了综合比较:结果:两台分析仪的小根线分类率为 90.3%,F1 分数为 86.6%。对于 F1 分数分别为 90.6% 和 95.6% 的数据,分析仪与分类器的一致性分别为 92.7% 和 96.8%。在狭窄分级方面,两个分析仪的一致性为 89.2%,F1 得分为 76.5%。在 91.5/89.4% 的数据上,分析仪-分类器的分级一致,F1 得分为 78.4/75.7%。分析器1和分析器2将大于74%的数据划分为A级(分别为78.8%和74.4%),15.4%和18.6%的数据划分为B级,4.2%和6.0%的数据划分为C级,1.6%和2.0%的数据划分为D级:在相似的解剖条件下,全自动深度学习模型在狭窄分级诊断和小根索分类方面表现出了很强的竞争力。然而,解剖结构的突然变化可能会导致仅根据图像进行诊断的困惑。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
ACS Applied Bio Materials
ACS Applied Bio Materials Chemistry-Chemistry (all)
CiteScore
9.40
自引率
2.10%
发文量
464
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信