REPEATABILITY OF THE CT OSTEOARTHRITIS KNEE SCORE (COAKS) AND A PROTOTYPE CT-GENERATED KELLGREN AND LAWRENCE GRADE

T.D. Turmezei , A. Boddu , Z. Akkaya , N.H. Degala , J.A. Lynch , N.A. Segal
{"title":"REPEATABILITY OF THE CT OSTEOARTHRITIS KNEE SCORE (COAKS) AND A PROTOTYPE CT-GENERATED KELLGREN AND LAWRENCE GRADE","authors":"T.D. Turmezei ,&nbsp;A. Boddu ,&nbsp;Z. Akkaya ,&nbsp;N.H. Degala ,&nbsp;J.A. Lynch ,&nbsp;N.A. Segal","doi":"10.1016/j.ostima.2025.100346","DOIUrl":null,"url":null,"abstract":"<div><h3>INTRODUCTION</h3><div>The CT Osteoarthritis Knee Score (COAKS) is a semiquantitative system for grading structural disease features of knee OA from weight bearing CT (WBCT). Previous work has demonstrated substantial to near-perfect inter- and intra-observer reliability of COAKS with the aid of a feature scoring atlas, but test-retest repeatability has not yet been evaluated. Given that x-ray and CT rely on the same fundamental physical properties, COAKS could also be harnessed to provide a CT-generated KLG and avoid the need for radiographic imaging.</div></div><div><h3>OBJECTIVE</h3><div>(1) To evaluate test-retest repeatability of COAKS; (2) to develop a CT-generated KLG (ctKLG) and evaluate its test-retest repeatability; and (3) to compare this prototype ctKLG against radiographic KLG (rKLG).</div></div><div><h3>METHODS</h3><div>14 individuals recruited and consented at the University of Kansas Medical Center had baseline and follow-up WBCT imaging suitable for analysis. Participant demographics were: mean ± SD age 61.3 ± 8.4 years, BMI 30.7 ± 4.3 kg/m<sup>2</sup> and male:female ratio 8:6. All scanning was performed on the same XFI WBCT scanner (Planmed Oy, Helsinki, Finland) with the mean ± SD interval between baseline and follow-up attendances 14.9 ± 8.1 days. A Synaflexer<sup>TM</sup> device was used to standardize knee positioning during scanning. Imaging acquisition parameters were 96 kV tube voltage, 51.4 mA tube current, 3.5 s exposure time. A standard bone algorithm was applied for reconstruction with 0.3 mm isotropic voxels and a 21 cm vertical scan range. All scans were anonymized prior to analysis both according to the individual and imaging attendance. All knees were reviewed for their COAKS by an experienced musculoskeletal radiologist (T.D.T.). Scores were recorded in a cloud-based file on Google Sheets (alongside the feature atlas in Google Docs) and read by a custom MATLAB script to generate structural heat maps. Test-retest repeatability weighted Kappa (Kw) scores were calculated for each feature (J = JSW; O = osteophytes; C = subchondral cysts; S = subchondral sclerosis) at each compartment (MTF = medial tibiofemoral; LTF = lateral tibiofemoral; PF = patellofemoral; PTF = proximal tibiofibular). A custom MATLAB script applied a decision tree based on recognized KLG verbal definitions to generate ctKLGs for each knee, including a combined score for the MTF and LTF compartments to mimic single-view AP radiographic conditions. A second experienced musculoskeletal radiologist (Z.A.) read study inclusion radiographs for rKLG likewise blinded. Kw was also calculated for ctKLG and rKLG.</div></div><div><h3>RESULTS</h3><div>Structural heatmaps are shown in Figure 1 for participants with ctKLGs of 1 (study ID 117, right knee) and 4 (study ID 101, right knee) alongside difference maps (follow-up minus baseline). These maps give examples of minimal difference in baseline and follow-up grading at the extremes of structural disease. Best repeatability by feature and compartment was for JSW at the MTF compartment with a Kw (95% CI) of 0.94 (0.93-0.96) and for osteophytes at the PF compartment with a Kw of 0.91 (0.90-0.93). Repeatability for each feature across all compartments was near-perfect (0.82 and above), except being substantial for subchondral sclerosis (0.72, 0.69-0.74). Repeatability for ctKLG was substantial for all individual compartments and near-perfect for the combined tibiofemoral score (0.83,0.80-0.86). Kw was likewise near perfect for rKLG (0.90, 0.88-0.93). Full repeatability Kw (95% CI) results for COAKS, prototype ctKLG and rKLG are given in Tables 1 &amp; 2. It was noted that ctKLG scores were sensitive to small changes in the decision tree based on verbal rKLG interpretation.</div></div><div><h3>CONCLUSION</h3><div>COAKS repeatability results were similar to those previously demonstrated for intra-observer rating, suggesting that scan factors are consistent enough to have little effect on reader performance. A ctKLG derived from COAKS was similarly repeatable to rKLG with near-perfect performance. A ctKLG model offers a means to stratify structural disease from WBCT without the need for radiographs, however further development is needed to establish a robust decision tree in deriving this from COAKS.</div></div>","PeriodicalId":74378,"journal":{"name":"Osteoarthritis imaging","volume":"5 ","pages":"Article 100346"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Osteoarthritis imaging","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772654125000868","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

INTRODUCTION

The CT Osteoarthritis Knee Score (COAKS) is a semiquantitative system for grading structural disease features of knee OA from weight bearing CT (WBCT). Previous work has demonstrated substantial to near-perfect inter- and intra-observer reliability of COAKS with the aid of a feature scoring atlas, but test-retest repeatability has not yet been evaluated. Given that x-ray and CT rely on the same fundamental physical properties, COAKS could also be harnessed to provide a CT-generated KLG and avoid the need for radiographic imaging.

OBJECTIVE

(1) To evaluate test-retest repeatability of COAKS; (2) to develop a CT-generated KLG (ctKLG) and evaluate its test-retest repeatability; and (3) to compare this prototype ctKLG against radiographic KLG (rKLG).

METHODS

14 individuals recruited and consented at the University of Kansas Medical Center had baseline and follow-up WBCT imaging suitable for analysis. Participant demographics were: mean ± SD age 61.3 ± 8.4 years, BMI 30.7 ± 4.3 kg/m2 and male:female ratio 8:6. All scanning was performed on the same XFI WBCT scanner (Planmed Oy, Helsinki, Finland) with the mean ± SD interval between baseline and follow-up attendances 14.9 ± 8.1 days. A SynaflexerTM device was used to standardize knee positioning during scanning. Imaging acquisition parameters were 96 kV tube voltage, 51.4 mA tube current, 3.5 s exposure time. A standard bone algorithm was applied for reconstruction with 0.3 mm isotropic voxels and a 21 cm vertical scan range. All scans were anonymized prior to analysis both according to the individual and imaging attendance. All knees were reviewed for their COAKS by an experienced musculoskeletal radiologist (T.D.T.). Scores were recorded in a cloud-based file on Google Sheets (alongside the feature atlas in Google Docs) and read by a custom MATLAB script to generate structural heat maps. Test-retest repeatability weighted Kappa (Kw) scores were calculated for each feature (J = JSW; O = osteophytes; C = subchondral cysts; S = subchondral sclerosis) at each compartment (MTF = medial tibiofemoral; LTF = lateral tibiofemoral; PF = patellofemoral; PTF = proximal tibiofibular). A custom MATLAB script applied a decision tree based on recognized KLG verbal definitions to generate ctKLGs for each knee, including a combined score for the MTF and LTF compartments to mimic single-view AP radiographic conditions. A second experienced musculoskeletal radiologist (Z.A.) read study inclusion radiographs for rKLG likewise blinded. Kw was also calculated for ctKLG and rKLG.

RESULTS

Structural heatmaps are shown in Figure 1 for participants with ctKLGs of 1 (study ID 117, right knee) and 4 (study ID 101, right knee) alongside difference maps (follow-up minus baseline). These maps give examples of minimal difference in baseline and follow-up grading at the extremes of structural disease. Best repeatability by feature and compartment was for JSW at the MTF compartment with a Kw (95% CI) of 0.94 (0.93-0.96) and for osteophytes at the PF compartment with a Kw of 0.91 (0.90-0.93). Repeatability for each feature across all compartments was near-perfect (0.82 and above), except being substantial for subchondral sclerosis (0.72, 0.69-0.74). Repeatability for ctKLG was substantial for all individual compartments and near-perfect for the combined tibiofemoral score (0.83,0.80-0.86). Kw was likewise near perfect for rKLG (0.90, 0.88-0.93). Full repeatability Kw (95% CI) results for COAKS, prototype ctKLG and rKLG are given in Tables 1 & 2. It was noted that ctKLG scores were sensitive to small changes in the decision tree based on verbal rKLG interpretation.

CONCLUSION

COAKS repeatability results were similar to those previously demonstrated for intra-observer rating, suggesting that scan factors are consistent enough to have little effect on reader performance. A ctKLG derived from COAKS was similarly repeatable to rKLG with near-perfect performance. A ctKLG model offers a means to stratify structural disease from WBCT without the need for radiographs, however further development is needed to establish a robust decision tree in deriving this from COAKS.
ct骨关节炎膝关节评分(coaks)和原型ct生成的kelgren和Lawrence分级的可重复性
CT骨关节炎膝关节评分(COAKS)是一种半定量系统,用于从负重CT (WBCT)对膝关节OA的结构性疾病特征进行分级。先前的工作已经证明了COAKS在特征评分图谱的帮助下,在观察者之间和观察者内部具有近乎完美的可靠性,但测试-重复测试的可重复性尚未得到评估。考虑到x射线和CT依赖于相同的基本物理性质,COAKS也可以用来提供CT生成的KLG,从而避免对放射成像的需要。(2)开发ct生成KLG (ctKLG)并评估其测试-重测可重复性;(3)将该原型ctKLG与放射成像KLG (rKLG)进行比较。方法在堪萨斯大学医学中心招募并同意的14名患者进行了适合分析的基线和随访WBCT成像。参与者的人口统计数据为:平均±SD年龄61.3±8.4岁,BMI 30.7±4.3 kg/m2,男女比例8:6。所有扫描均在同一台XFI WBCT扫描仪上进行(Planmed y, Helsinki, Finland),基线和随访的平均±SD间隔为14.9±8.1天。扫描时使用SynaflexerTM设备对膝关节定位进行标准化。成像采集参数为96 kV管电压,51.4 mA管电流,3.5 s曝光时间。采用标准骨算法重建,各向同性体素为0.3 mm,垂直扫描范围为21 cm。所有的扫描在分析前都是匿名的,根据个人和成像出勤率。所有膝关节均由经验丰富的肌肉骨骼放射科医生(T.D.T.)检查其COAKS。分数记录在谷歌Sheets上的基于云的文件中(与谷歌Docs中的特征图集一起),并由自定义MATLAB脚本读取以生成结构热图。计算每个特征的重测重复性加权Kappa (Kw)评分(J = JSW;O = 骨刺;C = 软骨下囊肿;S = 软骨下硬化)在每个隔室(MTF = 内侧胫股;LTF = 侧胫股的;PF = 髌股;PTF = 近端胫腓的)。定制的MATLAB脚本应用基于公认的KLG口头定义的决策树来生成每个膝关节的ctklg,包括MTF和LTF室的综合评分,以模拟单视图AP放射条件。另一位经验丰富的肌肉骨骼放射科医生(Z.A.)同样盲读了rKLG的研究包含x线片。还计算了ctKLG和rKLG的Kw。图1显示了ctKLGs为1(研究ID 117,右膝)和4(研究ID 101,右膝)的参与者的结构热图以及差异图(随访减去基线)。这些图给出了在极端的结构性疾病中基线和随访分级的最小差异的例子。按特征和区室划分的最佳重复性为MTF区JSW,其Kw (95% CI)为0.94 (0.93-0.96),PF区骨赘的Kw为0.91(0.90-0.93)。除了软骨下硬化症(0.72,0.69-0.74)外,所有隔室的每个特征的重复性接近完美(0.82及以上)。ctKLG的可重复性在所有单独的腔室中都是可观的,并且在胫股联合评分中接近完美(0.83,0.80-0.86)。Kw对于rKLG同样接近完美(0.90,0.88-0.93)。COAKS、原型ctKLG和rKLG的完全可重复性Kw (95% CI)结果见表1和表1;2. 值得注意的是,ctKLG分数对基于口头rKLG解释的决策树的微小变化敏感。结论coaks的可重复性结果与先前对观察者内部评分的结果相似,表明扫描因素足够一致,对读者的表现影响很小。COAKS衍生的ctKLG与rKLG具有相似的可重复性,性能接近完美。ctKLG模型提供了一种不需要x线片就能从WBCT中对结构性疾病进行分层的方法,但需要进一步发展,以建立一个从COAKS中得出的稳健决策树。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Osteoarthritis imaging
Osteoarthritis imaging Radiology and Imaging
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信