将机器学习应用于患者报告的结果,对医生衍生的类风湿关节炎疾病活动测量进行分类。

Jeffrey R Curtis, Yujie Su, Shawn Black, Stephen Xu, Wayne Langholff, Clifton O Bingham, Shelly Kafka, Fenglong Xie
{"title":"将机器学习应用于患者报告的结果,对医生衍生的类风湿关节炎疾病活动测量进行分类。","authors":"Jeffrey R Curtis,&nbsp;Yujie Su,&nbsp;Shawn Black,&nbsp;Stephen Xu,&nbsp;Wayne Langholff,&nbsp;Clifton O Bingham,&nbsp;Shelly Kafka,&nbsp;Fenglong Xie","doi":"10.1002/acr2.11499","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>Patient-reported outcome (PRO) data have assumed increasing importance in the care of patients with rheumatoid arthritis (RA), yet physician-derived disease activity measures, such as Clinical Disease Activity Index (CDAI), remain the most accepted metrics to assess disease activity. The possibility that newer longitudinal PRO data might be used as a proxy for the CDAI has not been evaluated.</p><p><strong>Methods: </strong>Using data from a large pragmatic trial, we evaluated patients with RA initiating golimumab intravenous or infliximab. The classification target was low disease activity (LDA) (CDAI ≤10) at the first visit between months 3 and 12. Data were randomly partitioned into training (80%) and test (20%) data sets. Multiple machine learning (ML) methods (eg, random forests, gradient boosting, support vector machines) were used to classify CDAI disease activity category, conduct feature selection, and assess feature importance. Model performance evaluated cross-validated error, comparing different ML approaches using both training and test data.</p><p><strong>Results: </strong>A total of 494 patients were analyzed, and 36.4% achieved LDA. The most important classification features included several Patient-Reported Outcomes Measurement Information System measures (social participation, pain interference, pain intensity, and physical function), patient global, and baseline CDAI. Among all ML methods, random forests performed best. Overall model accuracy and positive predictive values for all ML methods were approximately 80%.</p><p><strong>Conclusion: </strong>ML methods coupled with longitudinal PRO data appear useful and can achieve reasonable accuracy in classifying LDA among patients starting a new biologic. This approach has promise for real-world evidence generation in the common circumstance when physician-derived disease activity data are not available yet PRO measures are.</p>","PeriodicalId":7084,"journal":{"name":"ACR Open Rheumatology","volume":"4 12","pages":"995-1003"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/15/ac/ACR2-4-995.PMC9746663.pdf","citationCount":"2","resultStr":"{\"title\":\"Machine Learning Applied to Patient-Reported Outcomes to Classify Physician-Derived Measures of Rheumatoid Arthritis Disease Activity.\",\"authors\":\"Jeffrey R Curtis,&nbsp;Yujie Su,&nbsp;Shawn Black,&nbsp;Stephen Xu,&nbsp;Wayne Langholff,&nbsp;Clifton O Bingham,&nbsp;Shelly Kafka,&nbsp;Fenglong Xie\",\"doi\":\"10.1002/acr2.11499\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objective: </strong>Patient-reported outcome (PRO) data have assumed increasing importance in the care of patients with rheumatoid arthritis (RA), yet physician-derived disease activity measures, such as Clinical Disease Activity Index (CDAI), remain the most accepted metrics to assess disease activity. The possibility that newer longitudinal PRO data might be used as a proxy for the CDAI has not been evaluated.</p><p><strong>Methods: </strong>Using data from a large pragmatic trial, we evaluated patients with RA initiating golimumab intravenous or infliximab. The classification target was low disease activity (LDA) (CDAI ≤10) at the first visit between months 3 and 12. Data were randomly partitioned into training (80%) and test (20%) data sets. Multiple machine learning (ML) methods (eg, random forests, gradient boosting, support vector machines) were used to classify CDAI disease activity category, conduct feature selection, and assess feature importance. Model performance evaluated cross-validated error, comparing different ML approaches using both training and test data.</p><p><strong>Results: </strong>A total of 494 patients were analyzed, and 36.4% achieved LDA. The most important classification features included several Patient-Reported Outcomes Measurement Information System measures (social participation, pain interference, pain intensity, and physical function), patient global, and baseline CDAI. Among all ML methods, random forests performed best. Overall model accuracy and positive predictive values for all ML methods were approximately 80%.</p><p><strong>Conclusion: </strong>ML methods coupled with longitudinal PRO data appear useful and can achieve reasonable accuracy in classifying LDA among patients starting a new biologic. This approach has promise for real-world evidence generation in the common circumstance when physician-derived disease activity data are not available yet PRO measures are.</p>\",\"PeriodicalId\":7084,\"journal\":{\"name\":\"ACR Open Rheumatology\",\"volume\":\"4 12\",\"pages\":\"995-1003\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/15/ac/ACR2-4-995.PMC9746663.pdf\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ACR Open Rheumatology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1002/acr2.11499\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACR Open Rheumatology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1002/acr2.11499","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

目的:患者报告的预后(PRO)数据在类风湿关节炎(RA)患者的护理中越来越重要,然而医生衍生的疾病活动性测量,如临床疾病活动性指数(CDAI),仍然是最被接受的评估疾病活动性的指标。新的纵向PRO数据可能被用作CDAI的替代指标的可能性尚未得到评估。方法:使用一项大型实用试验的数据,我们评估了开始静脉注射戈利单抗或英夫利昔单抗的RA患者。分类目标为第3 ~ 12个月首次就诊时疾病活动性低(LDA) (CDAI≤10)。数据随机分为训练数据集(80%)和测试数据集(20%)。采用随机森林、梯度增强、支持向量机等多种机器学习方法对CDAI疾病活动类别进行分类、特征选择和特征重要性评估。模型性能评估交叉验证误差,使用训练和测试数据比较不同的ML方法。结果:共分析494例患者,达到LDA的比例为36.4%。最重要的分类特征包括几个患者报告的结果测量信息系统测量(社会参与、疼痛干扰、疼痛强度和身体功能)、患者总体和基线CDAI。在所有的机器学习方法中,随机森林的表现最好。所有ML方法的总体模型准确性和阳性预测值约为80%。结论:ML方法结合纵向PRO数据对开始使用新生物制剂的患者的LDA分类是有用的,并且可以达到合理的准确性。这种方法有望在医生来源的疾病活动数据尚未可用的常见情况下生成真实世界的证据。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Machine Learning Applied to Patient-Reported Outcomes to Classify Physician-Derived Measures of Rheumatoid Arthritis Disease Activity.

Machine Learning Applied to Patient-Reported Outcomes to Classify Physician-Derived Measures of Rheumatoid Arthritis Disease Activity.

Machine Learning Applied to Patient-Reported Outcomes to Classify Physician-Derived Measures of Rheumatoid Arthritis Disease Activity.

Machine Learning Applied to Patient-Reported Outcomes to Classify Physician-Derived Measures of Rheumatoid Arthritis Disease Activity.

Objective: Patient-reported outcome (PRO) data have assumed increasing importance in the care of patients with rheumatoid arthritis (RA), yet physician-derived disease activity measures, such as Clinical Disease Activity Index (CDAI), remain the most accepted metrics to assess disease activity. The possibility that newer longitudinal PRO data might be used as a proxy for the CDAI has not been evaluated.

Methods: Using data from a large pragmatic trial, we evaluated patients with RA initiating golimumab intravenous or infliximab. The classification target was low disease activity (LDA) (CDAI ≤10) at the first visit between months 3 and 12. Data were randomly partitioned into training (80%) and test (20%) data sets. Multiple machine learning (ML) methods (eg, random forests, gradient boosting, support vector machines) were used to classify CDAI disease activity category, conduct feature selection, and assess feature importance. Model performance evaluated cross-validated error, comparing different ML approaches using both training and test data.

Results: A total of 494 patients were analyzed, and 36.4% achieved LDA. The most important classification features included several Patient-Reported Outcomes Measurement Information System measures (social participation, pain interference, pain intensity, and physical function), patient global, and baseline CDAI. Among all ML methods, random forests performed best. Overall model accuracy and positive predictive values for all ML methods were approximately 80%.

Conclusion: ML methods coupled with longitudinal PRO data appear useful and can achieve reasonable accuracy in classifying LDA among patients starting a new biologic. This approach has promise for real-world evidence generation in the common circumstance when physician-derived disease activity data are not available yet PRO measures are.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信