用于肾小球疾病临床结果连续预测因子功能形态识别的岭回归

Jeremy Rubin, Laura Mariani, Abigail Smith, Jarcy Zee
{"title":"用于肾小球疾病临床结果连续预测因子功能形态识别的岭回归","authors":"Jeremy Rubin,&nbsp;Laura Mariani,&nbsp;Abigail Smith,&nbsp;Jarcy Zee","doi":"10.1159/000528847","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Penalized regression models can be used to identify and rank risk factors for poor quality of life or other outcomes. They often assume linear covariate associations, but the true associations may be nonlinear. There is no standard, automated method for determining optimal functional forms (shapes of relationships) between predictors and the outcome in high-dimensional data settings.</p><p><strong>Methods: </strong>We propose a novel algorithm, ridge regression for functional form identification of continuous predictors (RIPR) that models each continuous covariate with linear, quadratic, quartile, and cubic spline basis components in a ridge regression model to capture potential nonlinear relationships between continuous predictors and outcomes. We used a simulation study to test the performance of RIPR compared to standard and spline ridge regression models. Then, we applied RIPR to identify top predictors of Patient-Reported Outcomes Measurement Information System (PROMIS) adult global mental and physical health scores using demographic and clinical characteristics among <i>N</i> = 107 glomerular disease patients enrolled in the Nephrotic Syndrome Study Network (NEPTUNE).</p><p><strong>Results: </strong>RIPR resulted in better predictive accuracy than the standard and spline ridge regression methods in 56-80% of simulation repetitions under a variety of data characteristics. When applied to PROMIS scores in NEPTUNE, RIPR resulted in the lowest error for predicting physical scores, and the second-lowest error for mental scores. Further, RIPR identified hemoglobin quartiles as an important predictor of physical health that was missed by the other models.</p><p><strong>Conclusion: </strong>The RIPR algorithm can capture nonlinear functional forms of predictors that are missed by standard ridge regression models. The top predictors of PROMIS scores vary greatly across methods. RIPR should be considered alongside other machine learning models in the prediction of patient-reported outcomes and other continuous outcomes.</p>","PeriodicalId":73177,"journal":{"name":"Glomerular diseases","volume":"3 1","pages":"47-55"},"PeriodicalIF":0.0000,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/df/0d/gdz-0003-0047.PMC10126734.pdf","citationCount":"0","resultStr":"{\"title\":\"Ridge Regression for Functional Form Identification of Continuous Predictors of Clinical Outcomes in Glomerular Disease.\",\"authors\":\"Jeremy Rubin,&nbsp;Laura Mariani,&nbsp;Abigail Smith,&nbsp;Jarcy Zee\",\"doi\":\"10.1159/000528847\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Introduction: </strong>Penalized regression models can be used to identify and rank risk factors for poor quality of life or other outcomes. They often assume linear covariate associations, but the true associations may be nonlinear. There is no standard, automated method for determining optimal functional forms (shapes of relationships) between predictors and the outcome in high-dimensional data settings.</p><p><strong>Methods: </strong>We propose a novel algorithm, ridge regression for functional form identification of continuous predictors (RIPR) that models each continuous covariate with linear, quadratic, quartile, and cubic spline basis components in a ridge regression model to capture potential nonlinear relationships between continuous predictors and outcomes. We used a simulation study to test the performance of RIPR compared to standard and spline ridge regression models. Then, we applied RIPR to identify top predictors of Patient-Reported Outcomes Measurement Information System (PROMIS) adult global mental and physical health scores using demographic and clinical characteristics among <i>N</i> = 107 glomerular disease patients enrolled in the Nephrotic Syndrome Study Network (NEPTUNE).</p><p><strong>Results: </strong>RIPR resulted in better predictive accuracy than the standard and spline ridge regression methods in 56-80% of simulation repetitions under a variety of data characteristics. When applied to PROMIS scores in NEPTUNE, RIPR resulted in the lowest error for predicting physical scores, and the second-lowest error for mental scores. Further, RIPR identified hemoglobin quartiles as an important predictor of physical health that was missed by the other models.</p><p><strong>Conclusion: </strong>The RIPR algorithm can capture nonlinear functional forms of predictors that are missed by standard ridge regression models. The top predictors of PROMIS scores vary greatly across methods. RIPR should be considered alongside other machine learning models in the prediction of patient-reported outcomes and other continuous outcomes.</p>\",\"PeriodicalId\":73177,\"journal\":{\"name\":\"Glomerular diseases\",\"volume\":\"3 1\",\"pages\":\"47-55\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ftp.ncbi.nlm.nih.gov/pub/pmc/oa_pdf/df/0d/gdz-0003-0047.PMC10126734.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Glomerular diseases\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1159/000528847\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Glomerular diseases","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1159/000528847","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

惩罚回归模型可用于识别和排序生活质量差或其他结果的风险因素。它们通常假设线性协变量关联,但真正的关联可能是非线性的。在高维数据设置中,没有标准的、自动化的方法来确定预测因子和结果之间的最佳函数形式(关系的形状)。方法:我们提出了一种新的连续预测函数形式识别的脊回归算法(RIPR),该算法用脊回归模型中的线性、二次、四分位数和三次样条基分量对每个连续协变量进行建模,以捕捉连续预测因子与结果之间潜在的非线性关系。我们使用模拟研究来测试RIPR与标准和样条脊回归模型的性能。然后,我们利用纳入肾病综合征研究网络(NEPTUNE)的N = 107名肾小球疾病患者的人口学和临床特征,应用RIPR识别患者报告结局测量信息系统(PROMIS)成人全球身心健康评分的顶级预测因子。结果:在各种数据特征下,RIPR在56 ~ 80%的模拟重复次数上的预测精度优于标准和样条脊回归方法。当应用于海王星的PROMIS分数时,RIPR在预测身体分数方面的误差最低,在预测心理分数方面的误差第二低。此外,RIPR发现血红蛋白四分位数是其他模型所遗漏的身体健康的重要预测因子。结论:RIPR算法可以捕捉到标准脊回归模型所遗漏的非线性函数形式。PROMIS得分的最高预测因子在不同的方法中差异很大。在预测患者报告的结果和其他连续结果时,应将RIPR与其他机器学习模型一起考虑。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Ridge Regression for Functional Form Identification of Continuous Predictors of Clinical Outcomes in Glomerular Disease.

Ridge Regression for Functional Form Identification of Continuous Predictors of Clinical Outcomes in Glomerular Disease.

Introduction: Penalized regression models can be used to identify and rank risk factors for poor quality of life or other outcomes. They often assume linear covariate associations, but the true associations may be nonlinear. There is no standard, automated method for determining optimal functional forms (shapes of relationships) between predictors and the outcome in high-dimensional data settings.

Methods: We propose a novel algorithm, ridge regression for functional form identification of continuous predictors (RIPR) that models each continuous covariate with linear, quadratic, quartile, and cubic spline basis components in a ridge regression model to capture potential nonlinear relationships between continuous predictors and outcomes. We used a simulation study to test the performance of RIPR compared to standard and spline ridge regression models. Then, we applied RIPR to identify top predictors of Patient-Reported Outcomes Measurement Information System (PROMIS) adult global mental and physical health scores using demographic and clinical characteristics among N = 107 glomerular disease patients enrolled in the Nephrotic Syndrome Study Network (NEPTUNE).

Results: RIPR resulted in better predictive accuracy than the standard and spline ridge regression methods in 56-80% of simulation repetitions under a variety of data characteristics. When applied to PROMIS scores in NEPTUNE, RIPR resulted in the lowest error for predicting physical scores, and the second-lowest error for mental scores. Further, RIPR identified hemoglobin quartiles as an important predictor of physical health that was missed by the other models.

Conclusion: The RIPR algorithm can capture nonlinear functional forms of predictors that are missed by standard ridge regression models. The top predictors of PROMIS scores vary greatly across methods. RIPR should be considered alongside other machine learning models in the prediction of patient-reported outcomes and other continuous outcomes.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信