DEEP LEARNING MODELS FOR AUTOMATIC JOINT SPACE WIDTH MEASUREMENT

Osteoarthritis imaging Pub Date : 2025-01-01 DOI:10.1016/j.ostima.2025.100355

Z. Wang , J. Crawmer , A. Guermazi , J. Duryea , M. Jarraya

{"title":"DEEP LEARNING MODELS FOR AUTOMATIC JOINT SPACE WIDTH MEASUREMENT","authors":"Z. Wang , J. Crawmer , A. Guermazi , J. Duryea , M. Jarraya","doi":"10.1016/j.ostima.2025.100355","DOIUrl":null,"url":null,"abstract":"<div><h3>INTRODUCTION</h3><div>Accurate and automated measurement of femorotibial JSW (fJSW) is crucial for assessing and monitoring OA. Current semi-automated (SA) fJSW measurement methods can be time-consuming and prone to inter-observer variability. This work describes the evaluation of a deep learning (DL) approach to substantially automate fJSW measurement from knee radiographs.</div></div><div><h3>OBJECTIVE</h3><div>To evaluate the performance of a DL method for automatic fJSW measurement by comparing it to a standard SA method.</div></div><div><h3>METHODS</h3><div>We randomly selected a single knee radiograph from 295 OAI participants (49 knees for each KL grade 0-4) that were not used for DL training. We measured the BL and 48mo. medial fixed-location fJSW at x=0.25 using both the SA and DL methods. fJSW(x=0.25) have been shown to be the most responsive location compared to other fJSW locations and minimum JSW. The SA fJSW measurement consists of a first step to delineate the femur for setting up the necessary coordinate system, followed by a second step to delineate the femur and tibia for measuring fJSW. We trained separate DL algorithms for each step. The models employed an Attention U-Net architecture for segmenting joint spaces. This network enhances the standard U-Net encoder-decoder structure with attention mechanisms. The U-Net's encoder path progressively captures contextual information through a series of convolutional and pooling layers. The decoder path then gradually reconstructs the segmentation map by up-sampling features and combining them with high-resolution features from the encoder via skip connection. To assess performance, we calculated failure rates (assessed visually) for each step, the fJSW<sub>DL</sub> to fJSW<sub>SA</sub> correlation (Pearson’s R), and the responsiveness (standardized response mean: SRM). For DL coordinate system failures, the reader made manual corrections so all knees could be passed to the DL fJSW algorithm.</div></div><div><h3>RESULTS</h3><div>There were 58 coordinate systems failures (11.7%) with a KL distribution as follows: KL0:2, KL1:7, KL2:4, KL3:9, KL4:36, and 31 fJSW (6.2%) failures distributed as follows: KL0:4, KL1:1, KL2:4, KL3:7, KL4:15. We excluded the JSW failures leaving knees from 215 participants for the correlation and responsiveness analyses. The Pearson’s correlation was R = 0.97 and the SRM values were -0.64 (SA) and -0.67 (DL). Figure 1 is a Bland-Altman plot comparing the SA and DL fJSW, showing a minor bias and few outliers.</div></div><div><h3>CONCLUSION</h3><div>The results demonstrate that a DL algorithm can measure fJSW accurately with equivalent or better responsiveness compared to the SA method, dramatically reducing the reader time while maintaining performance. The majority of the failures were for KL4 knees, which are less utilized for KOA studies. The DL software has the potential to be used in very large studies and clinical trials of KOA.</div></div>","PeriodicalId":74378,"journal":{"name":"Osteoarthritis imaging","volume":"5 ","pages":"Article 100355"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Osteoarthritis imaging","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772654125000959","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

INTRODUCTION

Accurate and automated measurement of femorotibial JSW (fJSW) is crucial for assessing and monitoring OA. Current semi-automated (SA) fJSW measurement methods can be time-consuming and prone to inter-observer variability. This work describes the evaluation of a deep learning (DL) approach to substantially automate fJSW measurement from knee radiographs.

OBJECTIVE

To evaluate the performance of a DL method for automatic fJSW measurement by comparing it to a standard SA method.

METHODS

We randomly selected a single knee radiograph from 295 OAI participants (49 knees for each KL grade 0-4) that were not used for DL training. We measured the BL and 48mo. medial fixed-location fJSW at x=0.25 using both the SA and DL methods. fJSW(x=0.25) have been shown to be the most responsive location compared to other fJSW locations and minimum JSW. The SA fJSW measurement consists of a first step to delineate the femur for setting up the necessary coordinate system, followed by a second step to delineate the femur and tibia for measuring fJSW. We trained separate DL algorithms for each step. The models employed an Attention U-Net architecture for segmenting joint spaces. This network enhances the standard U-Net encoder-decoder structure with attention mechanisms. The U-Net's encoder path progressively captures contextual information through a series of convolutional and pooling layers. The decoder path then gradually reconstructs the segmentation map by up-sampling features and combining them with high-resolution features from the encoder via skip connection. To assess performance, we calculated failure rates (assessed visually) for each step, the fJSW_DL to fJSW_SA correlation (Pearson’s R), and the responsiveness (standardized response mean: SRM). For DL coordinate system failures, the reader made manual corrections so all knees could be passed to the DL fJSW algorithm.

RESULTS

There were 58 coordinate systems failures (11.7%) with a KL distribution as follows: KL0:2, KL1:7, KL2:4, KL3:9, KL4:36, and 31 fJSW (6.2%) failures distributed as follows: KL0:4, KL1:1, KL2:4, KL3:7, KL4:15. We excluded the JSW failures leaving knees from 215 participants for the correlation and responsiveness analyses. The Pearson’s correlation was R = 0.97 and the SRM values were -0.64 (SA) and -0.67 (DL). Figure 1 is a Bland-Altman plot comparing the SA and DL fJSW, showing a minor bias and few outliers.

CONCLUSION

The results demonstrate that a DL algorithm can measure fJSW accurately with equivalent or better responsiveness compared to the SA method, dramatically reducing the reader time while maintaining performance. The majority of the failures were for KL4 knees, which are less utilized for KOA studies. The DL software has the potential to be used in very large studies and clinical trials of KOA.

查看原文本刊更多论文

关节空间宽度自动测量的深度学习模型

准确和自动测量股胫JSW （fJSW）对于评估和监测OA至关重要。当前的半自动（SA） fJSW测量方法可能非常耗时，并且容易在观察者之间发生变化。这项工作描述了一种深度学习（DL）方法的评估，该方法基本上自动化了膝关节x线片的fJSW测量。目的通过与标准SA法的比较，评价DL法自动测量fJSW的性能。方法：我们从295名OAI参与者（每个KL等级为0-4级，49个膝关节）中随机选择一张未用于DL训练的膝关节x线片。我们测量了BL和48mo。在x=0.25时，使用SA和DL方法测量内侧固定位置fJSW。与其他fJSW位置相比，fJSW（x=0.25）是最敏感的位置，也是最小的JSW位置。SA fJSW测量包括第一步描绘股骨以建立必要的坐标系，第二步描绘股骨和胫骨以测量fJSW。我们为每一步训练单独的深度学习算法。该模型采用了Attention U-Net架构对关节空间进行分割。该网络通过注意机制增强了标准的U-Net编码器-解码器结构。U-Net的编码器路径通过一系列卷积层和池化层逐步捕获上下文信息。然后，解码器路径通过上采样特征逐步重建分割图，并通过跳过连接将其与编码器的高分辨率特征结合起来。为了评估性能，我们计算了每个步骤的故障率（目测）、fJSWDL与fJSWSA的相关性（Pearson’s R）和响应性（标准化响应平均值：SRM）。对于DL坐标系统故障，阅读器进行手动修正，以便将所有膝盖都传递给DL fJSW算法。结果共发生58例（11.7%）坐标系故障，KL分布为：KL0:2、KL1:7、KL2:4、KL3:9、KL4:36；发生31例（6.2%）fJSW故障，KL分布为：KL0:4、KL1:1、KL2:4、KL3:7、KL4:15。我们从215名参与者中排除了JSW故障，进行相关性和响应性分析。Pearson相关R = 0.97，SRM值分别为-0.64 （SA）和-0.67 （DL）。图1是比较jsw的SA和DL的Bland-Altman图，显示了轻微的偏差和少数异常值。结论DL算法可以准确地测量fJSW，与SA方法相比，其响应速度相当或更好，在保持性能的同时显著减少了读取时间。大多数失败的是KL4膝关节，很少用于KOA研究。DL软件有潜力用于KOA的大型研究和临床试验。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Osteoarthritis imaging Radiology and Imaging

自引率

0.00%

发文量