通过mri生物标志物聚类分析数据驱动的膝骨关节炎亚群发现

J.E. Collins , L.A. Deveza , D.J. Hunter , V.B.K. Kraus , A. Guermazi , F.W. Roemer , J.N. Katz , T. Neogi , E. Losina
{"title":"通过mri生物标志物聚类分析数据驱动的膝骨关节炎亚群发现","authors":"J.E. Collins ,&nbsp;L.A. Deveza ,&nbsp;D.J. Hunter ,&nbsp;V.B.K. Kraus ,&nbsp;A. Guermazi ,&nbsp;F.W. Roemer ,&nbsp;J.N. Katz ,&nbsp;T. Neogi ,&nbsp;E. Losina","doi":"10.1016/j.ostima.2025.100351","DOIUrl":null,"url":null,"abstract":"<div><h3>INTRODUCTION</h3><div>Identifying structural morphotypes in knee OA, subgroups defined by anatomical and morphological attributes, may facilitate personalized treatment by aligning specific patterns of joint damage with treatment mechanism of action. Cluster analysis is a type of unsupervised machine learning used to uncover subgroups and may provide insight into structural morphotypes in knee OA.</div></div><div><h3>OBJECTIVE</h3><div>To use cluster analysis to investigate possible subgroups defined by imaging features in a cohort of persons with knee OA.</div></div><div><h3>METHODS</h3><div>We used data from the PROGRESS OA study, the second phase of the FNIH OA Biomarkers Consortium project, which includes data from the placebo arms of several completed RCTs testing various therapeutic interventions for symptomatic knee OA. MRIs were obtained at baseline and read according to the MRI OA Knee Score (MOAKS) by an experienced radiologist. We included MOAKS assessments of BML size, osteophytes, cartilage, Hoffa-synovitis, effusion-synovitis, and meniscus in the clustering algorithms. Raw ordinal MOAKS scores were used in this analysis. We used Partitioning Around Medoids (PAM) for clustering. PAM is similar to K-means, but instead of defining cluster center as the centroid (mean), the medoid is used, making the method more robust to outliers and appropriate for non-Gaussian data. We undertook several approaches to clustering to perform dimension reduction and incorporate correlations between MOAKS scores A: PAM on Gower’s distance; B: PAM on the dissimilarity matrix from Spearman correlation; C: PAM after non-metric multidimensional scaling (NMDS) using Gower distance for dimension reduction. These approaches aimed to uncover patterns orthogonal to disease severity. The number of clusters was selected based on silhouette width and the gap statistic. Silhouette scores 0.25 to 0.5 indicate weak to reasonable fit.</div></div><div><h3>RESULTS</h3><div>356 participants from four RCTs were included, 138 (39%) with KLG 2 radiographs and 218 (61%) with KLG 3. The cohort was 57% female with average age 62 (SD 8). The number of clusters ranged from 2 to 3 depending on method. There was modest to high overlap between clustering solutions from different methods, suggesting some stability of clustering solutions. Average silhouette scores were 0.19, 0.13, 0.40 for methods A, B, and C, suggesting poor to modest fit. This could suggest weak structure, overlapping clusters, or need for additional dimension reduction. Methods A and C had one cluster dominated (&gt;95% KLG 3) by KLG 3 knees (Figure 1). Investigation of MOAKS assessments by cluster for each of three clustering solutions is shown in Table 1. For example, method C suggested 3 clusters. Clusters 1 and 2 are both approximately 55-60% KLG 2. Cluster 1 has more lateral cartilage damage, and higher BML and osteophyte scores, while cluster 2 has more medial cartilage damage and medial meniscal damage. Cluster 3 is 96% KLG 3 and has extensive medial cartilage damage, with 84% with widespread full-thickness damage in the MFTJ.</div></div><div><h3>CONCLUSION</h3><div>While knees can be separated into clusters based on tissue damage assessed by the MOAKS system, both level of disease severity and compartment involvement (medial vs. lateral) play important roles. Silhouette scores suggest the potential for overlapping clusters or the need for additional data reduction. The advanced disease stage common in DMOAD trial populations may limit the ability to identify meaningful structural morphotypes.</div></div>","PeriodicalId":74378,"journal":{"name":"Osteoarthritis imaging","volume":"5 ","pages":"Article 100351"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"DATA-DRIVEN DISCOVERY OF KNEE OSTEOARTHRITIS SUBGROUPS VIA CLUSTER ANALYSIS OF MRI BIOMARKERS\",\"authors\":\"J.E. Collins ,&nbsp;L.A. Deveza ,&nbsp;D.J. Hunter ,&nbsp;V.B.K. Kraus ,&nbsp;A. Guermazi ,&nbsp;F.W. Roemer ,&nbsp;J.N. Katz ,&nbsp;T. Neogi ,&nbsp;E. Losina\",\"doi\":\"10.1016/j.ostima.2025.100351\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>INTRODUCTION</h3><div>Identifying structural morphotypes in knee OA, subgroups defined by anatomical and morphological attributes, may facilitate personalized treatment by aligning specific patterns of joint damage with treatment mechanism of action. Cluster analysis is a type of unsupervised machine learning used to uncover subgroups and may provide insight into structural morphotypes in knee OA.</div></div><div><h3>OBJECTIVE</h3><div>To use cluster analysis to investigate possible subgroups defined by imaging features in a cohort of persons with knee OA.</div></div><div><h3>METHODS</h3><div>We used data from the PROGRESS OA study, the second phase of the FNIH OA Biomarkers Consortium project, which includes data from the placebo arms of several completed RCTs testing various therapeutic interventions for symptomatic knee OA. MRIs were obtained at baseline and read according to the MRI OA Knee Score (MOAKS) by an experienced radiologist. We included MOAKS assessments of BML size, osteophytes, cartilage, Hoffa-synovitis, effusion-synovitis, and meniscus in the clustering algorithms. Raw ordinal MOAKS scores were used in this analysis. We used Partitioning Around Medoids (PAM) for clustering. PAM is similar to K-means, but instead of defining cluster center as the centroid (mean), the medoid is used, making the method more robust to outliers and appropriate for non-Gaussian data. We undertook several approaches to clustering to perform dimension reduction and incorporate correlations between MOAKS scores A: PAM on Gower’s distance; B: PAM on the dissimilarity matrix from Spearman correlation; C: PAM after non-metric multidimensional scaling (NMDS) using Gower distance for dimension reduction. These approaches aimed to uncover patterns orthogonal to disease severity. The number of clusters was selected based on silhouette width and the gap statistic. Silhouette scores 0.25 to 0.5 indicate weak to reasonable fit.</div></div><div><h3>RESULTS</h3><div>356 participants from four RCTs were included, 138 (39%) with KLG 2 radiographs and 218 (61%) with KLG 3. The cohort was 57% female with average age 62 (SD 8). The number of clusters ranged from 2 to 3 depending on method. There was modest to high overlap between clustering solutions from different methods, suggesting some stability of clustering solutions. Average silhouette scores were 0.19, 0.13, 0.40 for methods A, B, and C, suggesting poor to modest fit. This could suggest weak structure, overlapping clusters, or need for additional dimension reduction. Methods A and C had one cluster dominated (&gt;95% KLG 3) by KLG 3 knees (Figure 1). Investigation of MOAKS assessments by cluster for each of three clustering solutions is shown in Table 1. For example, method C suggested 3 clusters. Clusters 1 and 2 are both approximately 55-60% KLG 2. Cluster 1 has more lateral cartilage damage, and higher BML and osteophyte scores, while cluster 2 has more medial cartilage damage and medial meniscal damage. Cluster 3 is 96% KLG 3 and has extensive medial cartilage damage, with 84% with widespread full-thickness damage in the MFTJ.</div></div><div><h3>CONCLUSION</h3><div>While knees can be separated into clusters based on tissue damage assessed by the MOAKS system, both level of disease severity and compartment involvement (medial vs. lateral) play important roles. Silhouette scores suggest the potential for overlapping clusters or the need for additional data reduction. The advanced disease stage common in DMOAD trial populations may limit the ability to identify meaningful structural morphotypes.</div></div>\",\"PeriodicalId\":74378,\"journal\":{\"name\":\"Osteoarthritis imaging\",\"volume\":\"5 \",\"pages\":\"Article 100351\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Osteoarthritis imaging\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2772654125000911\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Osteoarthritis imaging","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772654125000911","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

通过解剖和形态学属性定义的亚组,识别膝关节OA的结构形态类型,可以通过将关节损伤的特定模式与治疗作用机制结合起来,从而促进个性化治疗。聚类分析是一种无监督的机器学习,用于发现亚群,并可能提供对膝关节OA结构形态的见解。目的利用聚类分析研究膝关节OA患者队列中由影像学特征定义的可能亚群。方法:我们使用的数据来自FNIH OA生物标志物联盟项目第二阶段的PROGRESS OA研究,其中包括来自几项已完成的随机对照试验的安慰剂组的数据,这些随机对照试验测试了症状性膝关节OA的各种治疗干预措施。在基线时获得MRI,并由经验丰富的放射科医生根据MRI OA膝关节评分(MOAKS)读取。我们在聚类算法中纳入了BML大小、骨赘、软骨、hoffa -滑膜炎、积液-滑膜炎和半月板的MOAKS评估。本分析使用原始序号MOAKS分数。我们使用围绕介质的分区(PAM)进行集群。PAM类似于K-means,但它没有将聚类中心定义为质心(均值),而是使用了中位数,这使得该方法对异常值更具鲁棒性,适用于非高斯数据。我们采用了几种聚类方法来执行降维并合并MOAKS分数A: PAM在高尔距离上的相关性;B:基于Spearman相关的不相似矩阵的PAM;C:非度量多维尺度(NMDS)后的PAM,使用高尔距离进行降维。这些方法旨在揭示与疾病严重程度无关的模式。根据轮廓宽度和间隙统计量选择聚类数量。剪影值在0.25到0.5之间表示较弱到合理的拟合。结果4项随机对照试验共纳入356例受试者,其中klg2片138例(39%),klg3片218例(61%)。该队列57%为女性,平均年龄62岁(SD 8)。根据不同的方法,集群的数量从2到3不等。不同聚类方法的聚类解之间存在中度到高度的重叠,表明聚类解具有一定的稳定性。方法A、B和C的平均剪影评分分别为0.19、0.13和0.40,表明适合度较差至中等。这可能表明结构薄弱,集群重叠,或者需要额外的降维。方法A、C有1个聚类以klg3膝关节为主(95%为klg3膝关节)(图1)。表1显示了对三种聚类解决方案中每一种聚类的MOAKS评估的调查。例如,方法C建议3个集群。集群1和集群2都是大约55-60%的klg2。第1类多为外侧软骨损伤,BML和骨赘评分较高,第2类多为内侧软骨损伤和内侧半月板损伤。第3组96%为klg3,有广泛的内侧软骨损伤,84%在MFTJ有广泛的全层损伤。结论基于MOAKS系统评估的组织损伤可以将膝关节分成簇状,但疾病严重程度和筋膜室受累程度(内侧与外侧)起重要作用。剪影分数表明可能存在重叠的聚类或需要额外的数据缩减。在DMOAD试验人群中常见的疾病晚期可能限制了识别有意义的结构形态的能力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
DATA-DRIVEN DISCOVERY OF KNEE OSTEOARTHRITIS SUBGROUPS VIA CLUSTER ANALYSIS OF MRI BIOMARKERS

INTRODUCTION

Identifying structural morphotypes in knee OA, subgroups defined by anatomical and morphological attributes, may facilitate personalized treatment by aligning specific patterns of joint damage with treatment mechanism of action. Cluster analysis is a type of unsupervised machine learning used to uncover subgroups and may provide insight into structural morphotypes in knee OA.

OBJECTIVE

To use cluster analysis to investigate possible subgroups defined by imaging features in a cohort of persons with knee OA.

METHODS

We used data from the PROGRESS OA study, the second phase of the FNIH OA Biomarkers Consortium project, which includes data from the placebo arms of several completed RCTs testing various therapeutic interventions for symptomatic knee OA. MRIs were obtained at baseline and read according to the MRI OA Knee Score (MOAKS) by an experienced radiologist. We included MOAKS assessments of BML size, osteophytes, cartilage, Hoffa-synovitis, effusion-synovitis, and meniscus in the clustering algorithms. Raw ordinal MOAKS scores were used in this analysis. We used Partitioning Around Medoids (PAM) for clustering. PAM is similar to K-means, but instead of defining cluster center as the centroid (mean), the medoid is used, making the method more robust to outliers and appropriate for non-Gaussian data. We undertook several approaches to clustering to perform dimension reduction and incorporate correlations between MOAKS scores A: PAM on Gower’s distance; B: PAM on the dissimilarity matrix from Spearman correlation; C: PAM after non-metric multidimensional scaling (NMDS) using Gower distance for dimension reduction. These approaches aimed to uncover patterns orthogonal to disease severity. The number of clusters was selected based on silhouette width and the gap statistic. Silhouette scores 0.25 to 0.5 indicate weak to reasonable fit.

RESULTS

356 participants from four RCTs were included, 138 (39%) with KLG 2 radiographs and 218 (61%) with KLG 3. The cohort was 57% female with average age 62 (SD 8). The number of clusters ranged from 2 to 3 depending on method. There was modest to high overlap between clustering solutions from different methods, suggesting some stability of clustering solutions. Average silhouette scores were 0.19, 0.13, 0.40 for methods A, B, and C, suggesting poor to modest fit. This could suggest weak structure, overlapping clusters, or need for additional dimension reduction. Methods A and C had one cluster dominated (>95% KLG 3) by KLG 3 knees (Figure 1). Investigation of MOAKS assessments by cluster for each of three clustering solutions is shown in Table 1. For example, method C suggested 3 clusters. Clusters 1 and 2 are both approximately 55-60% KLG 2. Cluster 1 has more lateral cartilage damage, and higher BML and osteophyte scores, while cluster 2 has more medial cartilage damage and medial meniscal damage. Cluster 3 is 96% KLG 3 and has extensive medial cartilage damage, with 84% with widespread full-thickness damage in the MFTJ.

CONCLUSION

While knees can be separated into clusters based on tissue damage assessed by the MOAKS system, both level of disease severity and compartment involvement (medial vs. lateral) play important roles. Silhouette scores suggest the potential for overlapping clusters or the need for additional data reduction. The advanced disease stage common in DMOAD trial populations may limit the ability to identify meaningful structural morphotypes.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Osteoarthritis imaging
Osteoarthritis imaging Radiology and Imaging
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信