对用于腰骶椎盘病变分级的一整套放射学特征进行 SpineNetV2 外部验证

Q3 Medicine
Alemu Sisay Nigru MSc , Sergio Benini PhD , Matteo Bonetti MD , Graziella Bragaglio MSc , Michele Frigerio MD , Federico Maffezzoni MSc , Riccardo Leonardi PhD
{"title":"对用于腰骶椎盘病变分级的一整套放射学特征进行 SpineNetV2 外部验证","authors":"Alemu Sisay Nigru MSc ,&nbsp;Sergio Benini PhD ,&nbsp;Matteo Bonetti MD ,&nbsp;Graziella Bragaglio MSc ,&nbsp;Michele Frigerio MD ,&nbsp;Federico Maffezzoni MSc ,&nbsp;Riccardo Leonardi PhD","doi":"10.1016/j.xnsj.2024.100564","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>In recent years, the integration of Artificial Intelligence (AI) models has revolutionized the diagnosis of Low Back Pain (LBP) and associated disc pathologies. Among these, SpineNetV2 stands out as a state-of-the-art, open-access model for detecting and grading various intervertebral disc pathologies. However, ensuring the reliability and applicability of AI models like SpineNetV2 is paramount. Rigorous validation is essential to guarantee their robustness and generalizability across diverse patient cohorts and imaging protocols.</div></div><div><h3>Methods</h3><div>We conducted a retrospective analysis of MRI images of 1747 lumbosacral intervertebral discs (IVDs) from 353 patients (mean age, 54 ± 15.4 years, 44.5% female) with various spinal disorders, collected between September 2021 and February 2023 at X-Ray Service s.r.l. The SpineNetV2 system was used to grade 11 distinct lumbosacral disc pathologies, including Pfirrmann grading, disc narrowing, central canal stenosis, spondylolisthesis, (upper and lower) endplate defects, (upper and lower) marrow changes, (right and left) foraminal stenosis, and disc herniation, using T2-weighted sagittal MR images. Performance metrics included accuracy, balanced accuracy, precision, F1 score, Matthew's Correlation Coefficient, Brier Score Loss, Lin's concordance correlation coefficients, and Cohen's kappa coefficients. Two expert radiologists provide annotations for these discs. The evaluation of SpineNetV2′s grading is compared against expert radiologists' assessments.</div></div><div><h3>Results</h3><div>SpineNetV2 demonstrated strong performance across various metrics, with high agreement scores (Cohen's Kappa, Lin's Concordance, and Matthew's Correlation Coefficient exceeding 0.7) for most pathologies. However, lower agreement was found for foraminal stenosis and disc herniation, underscoring the limitations of sagittal MR images for evaluating these conditions.</div></div><div><h3>Conclusions</h3><div>This study highlights the importance of external validation, emphasizing the need for comprehensive assessments of deep learning models. SpineNetV2 exhibits promising results in predicting disc pathologies, with findings guiding further improvements. The open-source release of SpineNetV2 enables researchers to independently validate and extend the model's capabilities. This collaborative approach promotes innovation and accelerates the development of more reliable and comprehensive deep learning tools for the assessment of spine pathology.</div></div>","PeriodicalId":34622,"journal":{"name":"North American Spine Society Journal","volume":"20 ","pages":"Article 100564"},"PeriodicalIF":0.0000,"publicationDate":"2024-10-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"External validation of SpineNetV2 on a comprehensive set of radiological features for grading lumbosacral disc pathologies\",\"authors\":\"Alemu Sisay Nigru MSc ,&nbsp;Sergio Benini PhD ,&nbsp;Matteo Bonetti MD ,&nbsp;Graziella Bragaglio MSc ,&nbsp;Michele Frigerio MD ,&nbsp;Federico Maffezzoni MSc ,&nbsp;Riccardo Leonardi PhD\",\"doi\":\"10.1016/j.xnsj.2024.100564\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Background</h3><div>In recent years, the integration of Artificial Intelligence (AI) models has revolutionized the diagnosis of Low Back Pain (LBP) and associated disc pathologies. Among these, SpineNetV2 stands out as a state-of-the-art, open-access model for detecting and grading various intervertebral disc pathologies. However, ensuring the reliability and applicability of AI models like SpineNetV2 is paramount. Rigorous validation is essential to guarantee their robustness and generalizability across diverse patient cohorts and imaging protocols.</div></div><div><h3>Methods</h3><div>We conducted a retrospective analysis of MRI images of 1747 lumbosacral intervertebral discs (IVDs) from 353 patients (mean age, 54 ± 15.4 years, 44.5% female) with various spinal disorders, collected between September 2021 and February 2023 at X-Ray Service s.r.l. The SpineNetV2 system was used to grade 11 distinct lumbosacral disc pathologies, including Pfirrmann grading, disc narrowing, central canal stenosis, spondylolisthesis, (upper and lower) endplate defects, (upper and lower) marrow changes, (right and left) foraminal stenosis, and disc herniation, using T2-weighted sagittal MR images. Performance metrics included accuracy, balanced accuracy, precision, F1 score, Matthew's Correlation Coefficient, Brier Score Loss, Lin's concordance correlation coefficients, and Cohen's kappa coefficients. Two expert radiologists provide annotations for these discs. The evaluation of SpineNetV2′s grading is compared against expert radiologists' assessments.</div></div><div><h3>Results</h3><div>SpineNetV2 demonstrated strong performance across various metrics, with high agreement scores (Cohen's Kappa, Lin's Concordance, and Matthew's Correlation Coefficient exceeding 0.7) for most pathologies. However, lower agreement was found for foraminal stenosis and disc herniation, underscoring the limitations of sagittal MR images for evaluating these conditions.</div></div><div><h3>Conclusions</h3><div>This study highlights the importance of external validation, emphasizing the need for comprehensive assessments of deep learning models. SpineNetV2 exhibits promising results in predicting disc pathologies, with findings guiding further improvements. The open-source release of SpineNetV2 enables researchers to independently validate and extend the model's capabilities. This collaborative approach promotes innovation and accelerates the development of more reliable and comprehensive deep learning tools for the assessment of spine pathology.</div></div>\",\"PeriodicalId\":34622,\"journal\":{\"name\":\"North American Spine Society Journal\",\"volume\":\"20 \",\"pages\":\"Article 100564\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-10-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"North American Spine Society Journal\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666548424002579\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"Medicine\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"North American Spine Society Journal","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666548424002579","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0

摘要

背景近年来,人工智能(AI)模型的集成彻底改变了腰背痛(LBP)和相关椎间盘病变的诊断。其中,SpineNetV2 在检测和分级各种椎间盘病变方面脱颖而出,成为最先进的开放式模型。然而,确保 SpineNetV2 等人工智能模型的可靠性和适用性至关重要。我们对 2021 年 9 月至 2023 年 2 月期间在 X-Ray Service s.r.l.收集的 353 名患有各种脊柱疾病的患者(平均年龄为 54 ± 15.4 岁,44.5% 为女性)的 1747 个腰骶椎间盘(IVD)的 MRI 图像进行了回顾性分析。SpineNetV2 系统利用 T2 加权矢状磁共振图像对 11 种不同的腰骶椎间盘病变进行分级,包括 Pfirrmann 分级、椎间盘狭窄、中央管狭窄、椎体滑脱、(上下)终板缺损、(上下)骨髓改变、(左右)椎孔狭窄和椎间盘突出。性能指标包括准确度、平衡准确度、精确度、F1 评分、马修相关系数、布赖尔评分损失、林氏一致性相关系数和科恩卡帕系数。两名放射科专家为这些椎间盘提供注释。结果SpineNetV2在各种指标上都表现出色,在大多数病理上都有很高的一致性得分(Cohen's Kappa、Lin's Concordance和Matthew's Correlation Coefficient均超过0.7)。然而,椎管狭窄和椎间盘突出症的一致性较低,这突出表明矢状位磁共振图像在评估这些病症时存在局限性。结论这项研究突出了外部验证的重要性,强调了对深度学习模型进行全面评估的必要性。SpineNetV2 在预测椎间盘病变方面取得了可喜的成果,研究结果为进一步改进提供了指导。SpineNetV2 的开源发布使研究人员能够独立验证和扩展模型的功能。这种合作方式促进了创新,加快了用于脊柱病理学评估的更可靠、更全面的深度学习工具的开发。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
External validation of SpineNetV2 on a comprehensive set of radiological features for grading lumbosacral disc pathologies

Background

In recent years, the integration of Artificial Intelligence (AI) models has revolutionized the diagnosis of Low Back Pain (LBP) and associated disc pathologies. Among these, SpineNetV2 stands out as a state-of-the-art, open-access model for detecting and grading various intervertebral disc pathologies. However, ensuring the reliability and applicability of AI models like SpineNetV2 is paramount. Rigorous validation is essential to guarantee their robustness and generalizability across diverse patient cohorts and imaging protocols.

Methods

We conducted a retrospective analysis of MRI images of 1747 lumbosacral intervertebral discs (IVDs) from 353 patients (mean age, 54 ± 15.4 years, 44.5% female) with various spinal disorders, collected between September 2021 and February 2023 at X-Ray Service s.r.l. The SpineNetV2 system was used to grade 11 distinct lumbosacral disc pathologies, including Pfirrmann grading, disc narrowing, central canal stenosis, spondylolisthesis, (upper and lower) endplate defects, (upper and lower) marrow changes, (right and left) foraminal stenosis, and disc herniation, using T2-weighted sagittal MR images. Performance metrics included accuracy, balanced accuracy, precision, F1 score, Matthew's Correlation Coefficient, Brier Score Loss, Lin's concordance correlation coefficients, and Cohen's kappa coefficients. Two expert radiologists provide annotations for these discs. The evaluation of SpineNetV2′s grading is compared against expert radiologists' assessments.

Results

SpineNetV2 demonstrated strong performance across various metrics, with high agreement scores (Cohen's Kappa, Lin's Concordance, and Matthew's Correlation Coefficient exceeding 0.7) for most pathologies. However, lower agreement was found for foraminal stenosis and disc herniation, underscoring the limitations of sagittal MR images for evaluating these conditions.

Conclusions

This study highlights the importance of external validation, emphasizing the need for comprehensive assessments of deep learning models. SpineNetV2 exhibits promising results in predicting disc pathologies, with findings guiding further improvements. The open-source release of SpineNetV2 enables researchers to independently validate and extend the model's capabilities. This collaborative approach promotes innovation and accelerates the development of more reliable and comprehensive deep learning tools for the assessment of spine pathology.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
1.80
自引率
0.00%
发文量
71
审稿时长
48 days
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信