AUTOMATING IMAGING BIOMARKER ANALYSIS FOR KNEE OSTEOARTHRITIS USING AN OPEN-SOURCE MRI-BASED DEEP LEARNING PIPELINE

Osteoarthritis imaging Pub Date : 2025-01-01 DOI:10.1016/j.ostima.2025.100288

A. Goyal , F. Belibi , V. Sahani , R. Pedersen , Y. Vainberg , A. Williams , C. Chu , B. Haddock , G. Gold , A.S. Chaudhari , F. Kogan , A.A. Gatti

{"title":"AUTOMATING IMAGING BIOMARKER ANALYSIS FOR KNEE OSTEOARTHRITIS USING AN OPEN-SOURCE MRI-BASED DEEP LEARNING PIPELINE","authors":"A. Goyal , F. Belibi , V. Sahani , R. Pedersen , Y. Vainberg , A. Williams , C. Chu , B. Haddock , G. Gold , A.S. Chaudhari , F. Kogan , A.A. Gatti","doi":"10.1016/j.ostima.2025.100288","DOIUrl":null,"url":null,"abstract":"<div><h3>INTRODUCTION</h3><div>Quantitative MRI and [¹⁸F]NaF PET enable assessment of cartilage composition, bone shape, and subchondral bone metabolism in knee OA. Current workflows rely on manual segmentation that is time-consuming and subject to inter- and intra-reader variability. Furthermore, computing quantitative metrics requires considerable time and expertise. An open-source, automated, deep learning (DL) pipeline with standardized biomarker extraction has the potential to enhance reproducibility and make large-scale analysis accessible to clinical research communities, including non-technical users.</div></div><div><h3>OBJECTIVE</h3><div>Develop and validate an automated DL-based pipeline for comprehensive MRI-based segmentation and quantitative analysis of multiple knee tissues from multi-modal MR and PET images.</div></div><div><h3>METHODS</h3><div>We developed and open-sourced a comprehensive segmentation and analysis pipeline. A 2D U-Net was trained to segment 9 tissues using a dataset of 347 DESS and qDESS images: 3 bones (femur, tibia, patella), 4 cartilage regions (femoral, medial and lateral tibial, patellar), and 2 menisci (medial and lateral). Subchondral bone masks and femoral cartilage subregions were fitted automatically. Quantitative imaging biomarkers were computed as follows: cartilage T2 was computed analytically from qDESS scans; cartilage thickness was computed as the 3D Euclidean thickness of cartilage overlying the bone surface; meniscal volume was calculated as the product of voxel count and voxel volume; OA bone shape (BScore) was derived using a neural shape model; PET-derived subchondral bone metabolism was computed as regional SUVmean/max, and kinetic modeling via Hawkin’s method was used to extract KiNLR (bone mineralization rate) and K1 (perfusion to subchondral bone). To evaluate the pipeline, 20 unilateral qDESS and [¹⁸F]NaF PET knee scans (10 symptomatic OA, 10 controls) were analyzed by the automated pipeline, and two manual annotators. Manual and automated segmentations were compared using the Dice Similarity Coefficient (DSC) and average symmetric surface distance (ASSD). Biomarkers were compared using ICC and normalized mean RMSE (NRMSE).</div></div><div><h3>RESULTS</h3><div>All automated segmentations had good to excellent overlap measured using DSC (bone: 0.95-0.98; cartilage: 0.84-0.91; menisci: 0.85-0.89) and small surface errors (bone: 0.13-0.32 mm; cartilage: 0.11-0.21 mm; menisci: 0.17-0.30 mm). Notably, automated segmentations had better DSC and ASSD than the inter-rater comparison (Fig. 2). With the exception of cartilage thickness and patellar cartilage whole T2 values, all quantitative metrics showed excellent agreement with ICC >0.96 and NRMSE <0.1, comparable to inter-rater comparison. Bone metrics (BScore, SUV, PET kinetics) had ICC >0.96. Cartilage metrics had more variability, with the best reproducibility for whole cartilage T2 (ICC 0.89-0.98, NRMSE 0.01-0.04), then superficial T2 (ICC 0.93-0.99, NRMSE 0.01-0.05), and finally deep T2 (ICC 0.7-0.97, NRMSE 0.01-0.06). Cartilage thickness showed the worst reproducibility but still was comparable to inter-rater measures. Meniscus volume also shows high concordance (ICC 0.93-0.97; NRMSE 0.05-0.10). Overall, we found that most of the metrics derived from automated segmentations are comparable to those derived from manual segmentations.</div></div><div><h3>CONCLUSION</h3><div>Our open-source, AI-driven pipeline delivers rapid, accurate segmentation and quantitative analysis of multimodal knee MRI and PET data. Next steps include support for other MR sequences, multi-site validation, and 3D Slicer integration to facilitate translation. This resource provides a foundation for reproducible and scalable imaging biomarker analysis in OA research and clinical trials.</div></div>","PeriodicalId":74378,"journal":{"name":"Osteoarthritis imaging","volume":"5 ","pages":"Article 100288"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Osteoarthritis imaging","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772654125000285","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

INTRODUCTION

Quantitative MRI and [¹⁸F]NaF PET enable assessment of cartilage composition, bone shape, and subchondral bone metabolism in knee OA. Current workflows rely on manual segmentation that is time-consuming and subject to inter- and intra-reader variability. Furthermore, computing quantitative metrics requires considerable time and expertise. An open-source, automated, deep learning (DL) pipeline with standardized biomarker extraction has the potential to enhance reproducibility and make large-scale analysis accessible to clinical research communities, including non-technical users.

OBJECTIVE

Develop and validate an automated DL-based pipeline for comprehensive MRI-based segmentation and quantitative analysis of multiple knee tissues from multi-modal MR and PET images.

METHODS

We developed and open-sourced a comprehensive segmentation and analysis pipeline. A 2D U-Net was trained to segment 9 tissues using a dataset of 347 DESS and qDESS images: 3 bones (femur, tibia, patella), 4 cartilage regions (femoral, medial and lateral tibial, patellar), and 2 menisci (medial and lateral). Subchondral bone masks and femoral cartilage subregions were fitted automatically. Quantitative imaging biomarkers were computed as follows: cartilage T2 was computed analytically from qDESS scans; cartilage thickness was computed as the 3D Euclidean thickness of cartilage overlying the bone surface; meniscal volume was calculated as the product of voxel count and voxel volume; OA bone shape (BScore) was derived using a neural shape model; PET-derived subchondral bone metabolism was computed as regional SUVmean/max, and kinetic modeling via Hawkin’s method was used to extract KiNLR (bone mineralization rate) and K1 (perfusion to subchondral bone). To evaluate the pipeline, 20 unilateral qDESS and [¹⁸F]NaF PET knee scans (10 symptomatic OA, 10 controls) were analyzed by the automated pipeline, and two manual annotators. Manual and automated segmentations were compared using the Dice Similarity Coefficient (DSC) and average symmetric surface distance (ASSD). Biomarkers were compared using ICC and normalized mean RMSE (NRMSE).

RESULTS

All automated segmentations had good to excellent overlap measured using DSC (bone: 0.95-0.98; cartilage: 0.84-0.91; menisci: 0.85-0.89) and small surface errors (bone: 0.13-0.32 mm; cartilage: 0.11-0.21 mm; menisci: 0.17-0.30 mm). Notably, automated segmentations had better DSC and ASSD than the inter-rater comparison (Fig. 2). With the exception of cartilage thickness and patellar cartilage whole T2 values, all quantitative metrics showed excellent agreement with ICC >0.96 and NRMSE <0.1, comparable to inter-rater comparison. Bone metrics (BScore, SUV, PET kinetics) had ICC >0.96. Cartilage metrics had more variability, with the best reproducibility for whole cartilage T2 (ICC 0.89-0.98, NRMSE 0.01-0.04), then superficial T2 (ICC 0.93-0.99, NRMSE 0.01-0.05), and finally deep T2 (ICC 0.7-0.97, NRMSE 0.01-0.06). Cartilage thickness showed the worst reproducibility but still was comparable to inter-rater measures. Meniscus volume also shows high concordance (ICC 0.93-0.97; NRMSE 0.05-0.10). Overall, we found that most of the metrics derived from automated segmentations are comparable to those derived from manual segmentations.

CONCLUSION

Our open-source, AI-driven pipeline delivers rapid, accurate segmentation and quantitative analysis of multimodal knee MRI and PET data. Next steps include support for other MR sequences, multi-site validation, and 3D Slicer integration to facilitate translation. This resource provides a foundation for reproducible and scalable imaging biomarker analysis in OA research and clinical trials.

查看原文本刊更多论文

使用开源的基于核磁共振的深度学习管道对膝关节骨关节炎进行自动成像生物标志物分析

定量MRI和[¹⁸F]NaF PET可以评估膝关节OA患者的软骨组成、骨形状和软骨下骨代谢。当前的工作流程依赖于手动分割，这是耗时的，并受到阅读器之间和内部变化的影响。此外，计算定量度量需要大量的时间和专业知识。具有标准化生物标志物提取的开源、自动化、深度学习（DL）管道有可能提高可重复性，并为临床研究社区（包括非技术用户）提供大规模分析。目的：开发并验证一种自动化的基于dl的管道，用于从多模态MR和PET图像中对多个膝关节组织进行全面的基于mri的分割和定量分析。方法我们开发并开源了一个全面的细分和分析流程。使用347张DESS和qDESS图像数据集训练2D U-Net来分割9个组织：3块骨头（股骨，胫骨，髌骨），4个软骨区域（股骨，胫骨内侧和外侧，髌骨）和2个半月板（内侧和外侧）。软骨下骨面罩和股骨软骨亚区自动拟合。定量成像生物标志物计算如下：软骨T2通过qDESS扫描分析计算；软骨厚度计算为覆盖骨表面的软骨的三维欧几里得厚度；半月板体积计算为体素数与体素体积的乘积；采用神经形态模型推导OA骨形态（BScore）；pet衍生的软骨下骨代谢计算为区域SUVmean/max，通过Hawkin方法进行动力学建模，提取KiNLR（骨矿化率）和K1（软骨下骨灌注）。为了评估管道，我们使用自动管道和2个手动注释器分析了20个单侧qDESS和[¹⁸F]NaF PET膝关节扫描（10个症状性OA， 10个对照组）。使用Dice Similarity Coefficient （DSC）和平均对称表面距离（ASSD）对手动分割和自动分割进行比较。使用ICC和标准化平均RMSE （NRMSE）比较生物标志物。结果DSC测量所有自动分割的重叠度均为良好至优异(骨：0.95 ~ 0.98；软骨:0.84 - -0.91;半月板：0.85-0.89)，表面误差小(骨：0.13-0.32 mm；软骨：0.11-0.21 mm；半月板：0.17-0.30 mm)。值得注意的是，自动分割的DSC和ASSD优于内部比较（图2）。除了软骨厚度和髌骨软骨整体T2值外，所有定量指标均与ICC >；0.96和NRMSE <；0.1非常吻合，与间比较相当。骨骼指标（BScore， SUV， PET动力学）的ICC为0.96。软骨指标的可变性更大，T2全软骨的重现性最好（ICC 0.89-0.98, NRMSE 0.01-0.04），其次是T2浅层（ICC 0.93-0.99, NRMSE 0.01-0.05），最后是T2深层（ICC 0.7-0.97, NRMSE 0.01-0.06）。软骨厚度的再现性最差，但仍可与其他测量方法相媲美。半月板体积也显示出高度的一致性(ICC 0.93-0.97；NRMSE 0.05 - -0.10)。总的来说，我们发现大多数来自自动分割的指标与那些来自手动分割的指标是相当的。我们的开源、人工智能驱动的管道提供快速、准确的多模态膝关节MRI和PET数据分割和定量分析。接下来的步骤包括支持其他MR序列、多位点验证和3D切片器集成以促进翻译。该资源为OA研究和临床试验中可重复和可扩展的成像生物标志物分析提供了基础。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Osteoarthritis imaging Radiology and Imaging

自引率

0.00%

发文量