Deep Learning-Assisted Quantitative Measurement of Thoracolumbar Fracture Features on Lateral Radiographs

IF 3.8 2区 医学 Q1 CLINICAL NEUROLOGY
Neurospine Pub Date : 2024-03-01 DOI:10.14245/ns.2347366.683
W. Yuh, E. Khil, Y. Yoon, Burnyoung Kim, Hongjun Yoon, Jihe Lim, Kyoung Yeon Lee, Yeong Seo Yoo, Kyeong Deuk An
{"title":"Deep Learning-Assisted Quantitative Measurement of Thoracolumbar Fracture Features on Lateral Radiographs","authors":"W. Yuh, E. Khil, Y. Yoon, Burnyoung Kim, Hongjun Yoon, Jihe Lim, Kyoung Yeon Lee, Yeong Seo Yoo, Kyeong Deuk An","doi":"10.14245/ns.2347366.683","DOIUrl":null,"url":null,"abstract":"Objective This study aimed to develop and validate a deep learning (DL) algorithm for the quantitative measurement of thoracolumbar (TL) fracture features, and to evaluate its efficacy across varying levels of clinical expertise. Methods Using the pretrained Mask Region-Based Convolutional Neural Networks model, originally developed for vertebral body segmentation and fracture detection, we fine-tuned the model and added a new module for measuring fracture metrics—compression rate (CR), Cobb angle (CA), Gardner angle (GA), and sagittal index (SI)—from lumbar spine lateral radiographs. These metrics were derived from six-point labeling by 3 radiologists, forming the ground truth (GT). Training utilized 1,000 nonfractured and 318 fractured radiographs, while validations employed 213 internal and 200 external fractured radiographs. The accuracy of the DL algorithm in quantifying fracture features was evaluated against GT using the intraclass correlation coefficient. Additionally, 4 readers with varying expertise levels, including trainees and an attending spine surgeon, performed measurements with and without DL assistance, and their results were compared to GT and the DL model. Results The DL algorithm demonstrated good to excellent agreement with GT for CR, CA, GA, and SI in both internal (0.860, 0.944, 0.932, and 0.779, respectively) and external (0.836, 0.940, 0.916, and 0.815, respectively) validations. DL-assisted measurements significantly improved most measurement values, particularly for trainees. Conclusion The DL algorithm was validated as an accurate tool for quantifying TL fracture features using radiographs. DL-assisted measurement is expected to expedite the diagnostic process and enhance reliability, particularly benefiting less experienced clinicians.","PeriodicalId":19269,"journal":{"name":"Neurospine","volume":null,"pages":null},"PeriodicalIF":3.8000,"publicationDate":"2024-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neurospine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.14245/ns.2347366.683","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}
引用次数: 1

Abstract

Objective This study aimed to develop and validate a deep learning (DL) algorithm for the quantitative measurement of thoracolumbar (TL) fracture features, and to evaluate its efficacy across varying levels of clinical expertise. Methods Using the pretrained Mask Region-Based Convolutional Neural Networks model, originally developed for vertebral body segmentation and fracture detection, we fine-tuned the model and added a new module for measuring fracture metrics—compression rate (CR), Cobb angle (CA), Gardner angle (GA), and sagittal index (SI)—from lumbar spine lateral radiographs. These metrics were derived from six-point labeling by 3 radiologists, forming the ground truth (GT). Training utilized 1,000 nonfractured and 318 fractured radiographs, while validations employed 213 internal and 200 external fractured radiographs. The accuracy of the DL algorithm in quantifying fracture features was evaluated against GT using the intraclass correlation coefficient. Additionally, 4 readers with varying expertise levels, including trainees and an attending spine surgeon, performed measurements with and without DL assistance, and their results were compared to GT and the DL model. Results The DL algorithm demonstrated good to excellent agreement with GT for CR, CA, GA, and SI in both internal (0.860, 0.944, 0.932, and 0.779, respectively) and external (0.836, 0.940, 0.916, and 0.815, respectively) validations. DL-assisted measurements significantly improved most measurement values, particularly for trainees. Conclusion The DL algorithm was validated as an accurate tool for quantifying TL fracture features using radiographs. DL-assisted measurement is expected to expedite the diagnostic process and enhance reliability, particularly benefiting less experienced clinicians.
深度学习辅助定量测量侧位片上的胸腰椎骨折特征
目的 本研究旨在开发和验证一种用于定量测量胸腰椎(TL)骨折特征的深度学习(DL)算法,并评估其在不同临床专业知识水平下的功效。方法 利用最初为椎体分割和骨折检测而开发的预训练掩模区域卷积神经网络模型,我们对模型进行了微调,并添加了一个新模块,用于测量腰椎侧位片的骨折指标--压缩率(CR)、科布角(CA)、加德纳角(GA)和矢状位指数(SI)。这些指标是由 3 位放射科医生通过六点标注得出的,形成了地面实况(GT)。训练使用了 1,000 张非骨折和 318 张骨折X光片,验证使用了 213 张内骨折和 200 张外骨折X光片。使用类内相关系数评估了 DL 算法与 GT 在量化骨折特征方面的准确性。此外,4 名具有不同专业水平的读者(包括受训者和一名主治脊柱外科医生)在有 DL 辅助和无 DL 辅助的情况下进行了测量,并将他们的测量结果与 GT 和 DL 模型进行了比较。结果 在内部(分别为 0.860、0.944、0.932 和 0.779)和外部(分别为 0.836、0.940、0.916 和 0.815)验证中,DL 算法在 CR、CA、GA 和 SI 方面与 GT 的一致性良好到极佳。DL 辅助测量显著改善了大多数测量值,尤其是对受训者而言。结论 经验证,DL 算法是使用 X 光片量化 TL 骨折特征的准确工具。DL 辅助测量有望加快诊断过程并提高可靠性,尤其有利于经验不足的临床医生。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Neurospine
Neurospine Multiple-
CiteScore
5.80
自引率
18.80%
发文量
93
审稿时长
10 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信