从美国国立卫生研究院生物标志物联盟进展队列中使用机器学习研究群体选择

Osteoarthritis imaging Pub Date : 2025-01-01 DOI:10.1016/j.ostima.2025.100283

E.B. Dam , J. Collins , F. Eckstein , F.W. Roemer , A. Guermazi , D.J. Hunter

{"title":"从美国国立卫生研究院生物标志物联盟进展队列中使用机器学习研究群体选择","authors":"E.B. Dam , J. Collins , F. Eckstein , F.W. Roemer , A. Guermazi , D.J. Hunter","doi":"10.1016/j.ostima.2025.100283","DOIUrl":null,"url":null,"abstract":"<div><h3>INTRODUCTION</h3><div>Stringent participant selection criteria for DMOAD trials are crucial, but there is no consensus on the criteria currently accepted by the regulatory authorities, particularly not criteria adapted to different treatment targets. Ensuring a population with a higher probability of treatment-specific OA progression may facilitate cost-effective trials with less risk of failure.</div></div><div><h3>OBJECTIVE</h3><div>To investigate whether simple Machine Learning (ML) methods provide more effective and/or transparent selection criteria than conventional statistical methods.</div></div><div><h3>METHODS</h3><div>We investigated the FNIH Biomarkers Consortium cohorts. Phase 1 included 600 subjects from the OAI, as a case/control cohort wrt. OA progression defined by JSW and/or pain progression (JSN decrease ³ 0.7 mm, WOMAC total pain increase ³ 9). Phase 2 included control groups from DMOAD trials (SEKOIA, VIDEO, ILLUSTRATE-K, ROCCELLA), in total 1233 subjects, using the same JSW/Pain endpoints. Consortium members provided biomarker scores for potentially prognostic biomarkers. We focused on the baseline imaging biomarkers submitted for both phases, including semi-quantitative MOAKS readings and quantitative cartilage morphology from MRI. We analyzed the sub-cohorts with complete imaging and clinical biomarkers, i.e. 600 and 366 subjects, respectively. We used a k-nearest neighbor classifier to predict progression as defined by the endpoints selecting a biomarker subset using sequential forward feature selection (SFFS). Model training and validation were performed using 10-fold cross-validation (CV). We measured model performance by the median AUC score across the CV test sets. The performance was compared to a classical logistic regression model with elastic-net regularization, which was trained and scored using the same SFFS and CV.</div></div><div><h3>RESULTS</h3><div>The AUC scores and selected imaging biomarkers for each progression endpoints are shown in Table 1 below. In general, compared to the Logistic Regression models, the kNN models performed on par or slightly better (Phase 1: 0.80 vs 0.79 for JSN and 0.66 vs 0.68 for Pain; Phase 2: 0.76 vs 0.77 for JSN and 0.82 vs 0.73 for Pain).</div></div><div><h3>CONCLUSION</h3><div>The ML model possibly performed slightly better than classical logistic regression. However, it should be analyzed whether the two models include the same biomarkers and what the implications are for the required study sample size. Further, the feature selection step is very relevant for clinical trial design, to ensure a limited yet predictive set of features.</div></div>","PeriodicalId":74378,"journal":{"name":"Osteoarthritis imaging","volume":"5 ","pages":"Article 100283"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"STUDY POPULATION SELECTION USING MACHINE LEARNING FROM THE FNIH BIOMARKERS CONSORTIUM PROGRESS OA COHORT\",\"authors\":\"E.B. Dam , J. Collins , F. Eckstein , F.W. Roemer , A. Guermazi , D.J. Hunter\",\"doi\":\"10.1016/j.ostima.2025.100283\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>INTRODUCTION</h3><div>Stringent participant selection criteria for DMOAD trials are crucial, but there is no consensus on the criteria currently accepted by the regulatory authorities, particularly not criteria adapted to different treatment targets. Ensuring a population with a higher probability of treatment-specific OA progression may facilitate cost-effective trials with less risk of failure.</div></div><div><h3>OBJECTIVE</h3><div>To investigate whether simple Machine Learning (ML) methods provide more effective and/or transparent selection criteria than conventional statistical methods.</div></div><div><h3>METHODS</h3><div>We investigated the FNIH Biomarkers Consortium cohorts. Phase 1 included 600 subjects from the OAI, as a case/control cohort wrt. OA progression defined by JSW and/or pain progression (JSN decrease ³ 0.7 mm, WOMAC total pain increase ³ 9). Phase 2 included control groups from DMOAD trials (SEKOIA, VIDEO, ILLUSTRATE-K, ROCCELLA), in total 1233 subjects, using the same JSW/Pain endpoints. Consortium members provided biomarker scores for potentially prognostic biomarkers. We focused on the baseline imaging biomarkers submitted for both phases, including semi-quantitative MOAKS readings and quantitative cartilage morphology from MRI. We analyzed the sub-cohorts with complete imaging and clinical biomarkers, i.e. 600 and 366 subjects, respectively. We used a k-nearest neighbor classifier to predict progression as defined by the endpoints selecting a biomarker subset using sequential forward feature selection (SFFS). Model training and validation were performed using 10-fold cross-validation (CV). We measured model performance by the median AUC score across the CV test sets. The performance was compared to a classical logistic regression model with elastic-net regularization, which was trained and scored using the same SFFS and CV.</div></div><div><h3>RESULTS</h3><div>The AUC scores and selected imaging biomarkers for each progression endpoints are shown in Table 1 below. In general, compared to the Logistic Regression models, the kNN models performed on par or slightly better (Phase 1: 0.80 vs 0.79 for JSN and 0.66 vs 0.68 for Pain; Phase 2: 0.76 vs 0.77 for JSN and 0.82 vs 0.73 for Pain).</div></div><div><h3>CONCLUSION</h3><div>The ML model possibly performed slightly better than classical logistic regression. However, it should be analyzed whether the two models include the same biomarkers and what the implications are for the required study sample size. Further, the feature selection step is very relevant for clinical trial design, to ensure a limited yet predictive set of features.</div></div>\",\"PeriodicalId\":74378,\"journal\":{\"name\":\"Osteoarthritis imaging\",\"volume\":\"5 \",\"pages\":\"Article 100283\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Osteoarthritis imaging\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2772654125000236\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Osteoarthritis imaging","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772654125000236","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

严格的DMOAD试验参与者选择标准是至关重要的，但目前监管机构对接受的标准没有共识，特别是不适合不同治疗目标的标准。确保人群有更高的治疗特异性OA进展的可能性，可以促进具有成本效益的试验，降低失败的风险。目的探讨简单机器学习（ML）方法是否比传统统计方法提供更有效和/或透明的选择标准。方法：我们调查了FNIH生物标志物联盟的队列。第一阶段包括来自OAI的600名受试者，作为病例/对照队列wrt。骨性关节炎进展由JSW和/或疼痛进展定义（JSN减少0.7 mm， WOMAC总疼痛增加3.9 mm）。第二阶段包括来自DMOAD试验的对照组（SEKOIA、VIDEO、ILLUSTRATE-K、ROCCELLA），共1233名受试者，使用相同的JSW/Pain终点。联盟成员提供了潜在预后生物标志物的生物标志物评分。我们专注于这两个阶段提交的基线成像生物标志物，包括半定量MOAKS读数和MRI定量软骨形态。我们分析了具有完整影像学和临床生物标志物的亚队列，分别为600名和366名受试者。我们使用k近邻分类器来预测进展，该进展由使用顺序前向特征选择（SFFS）选择生物标志物子集的端点定义。采用10倍交叉验证（CV）进行模型训练和验证。我们通过CV测试集的中位数AUC得分来衡量模型的性能。将其性能与经典的弹性网络正则化逻辑回归模型进行比较，后者使用相同的SFFS和CV进行训练和评分。每个进展终点的AUC评分和选择的成像生物标志物如下表1所示。一般来说，与Logistic回归模型相比，kNN模型的表现相当或略好(第一阶段：JSN的0.80 vs 0.79， Pain的0.66 vs 0.68；第二阶段：JSN为0.76 vs 0.77， Pain为0.82 vs 0.73)。结论ML模型可能略优于经典逻辑回归。然而，应该分析这两种模型是否包含相同的生物标志物，以及对所需研究样本量的影响。此外，特征选择步骤与临床试验设计非常相关，以确保有限但可预测的特征集。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

STUDY POPULATION SELECTION USING MACHINE LEARNING FROM THE FNIH BIOMARKERS CONSORTIUM PROGRESS OA COHORT

INTRODUCTION

Stringent participant selection criteria for DMOAD trials are crucial, but there is no consensus on the criteria currently accepted by the regulatory authorities, particularly not criteria adapted to different treatment targets. Ensuring a population with a higher probability of treatment-specific OA progression may facilitate cost-effective trials with less risk of failure.

OBJECTIVE

To investigate whether simple Machine Learning (ML) methods provide more effective and/or transparent selection criteria than conventional statistical methods.

METHODS

We investigated the FNIH Biomarkers Consortium cohorts. Phase 1 included 600 subjects from the OAI, as a case/control cohort wrt. OA progression defined by JSW and/or pain progression (JSN decrease ³ 0.7 mm, WOMAC total pain increase ³ 9). Phase 2 included control groups from DMOAD trials (SEKOIA, VIDEO, ILLUSTRATE-K, ROCCELLA), in total 1233 subjects, using the same JSW/Pain endpoints. Consortium members provided biomarker scores for potentially prognostic biomarkers. We focused on the baseline imaging biomarkers submitted for both phases, including semi-quantitative MOAKS readings and quantitative cartilage morphology from MRI. We analyzed the sub-cohorts with complete imaging and clinical biomarkers, i.e. 600 and 366 subjects, respectively. We used a k-nearest neighbor classifier to predict progression as defined by the endpoints selecting a biomarker subset using sequential forward feature selection (SFFS). Model training and validation were performed using 10-fold cross-validation (CV). We measured model performance by the median AUC score across the CV test sets. The performance was compared to a classical logistic regression model with elastic-net regularization, which was trained and scored using the same SFFS and CV.

RESULTS

The AUC scores and selected imaging biomarkers for each progression endpoints are shown in Table 1 below. In general, compared to the Logistic Regression models, the kNN models performed on par or slightly better (Phase 1: 0.80 vs 0.79 for JSN and 0.66 vs 0.68 for Pain; Phase 2: 0.76 vs 0.77 for JSN and 0.82 vs 0.73 for Pain).

CONCLUSION

The ML model possibly performed slightly better than classical logistic regression. However, it should be analyzed whether the two models include the same biomarkers and what the implications are for the required study sample size. Further, the feature selection step is very relevant for clinical trial design, to ensure a limited yet predictive set of features.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Osteoarthritis imaging Radiology and Imaging

自引率

0.00%

发文量