Finding a constrained number of predictor phenotypes for multiple outcome prediction.

IF 4.1 Q1 HEALTH CARE SCIENCES & SERVICES
Jenna M Reps, Jenna Wong, Egill A Fridgeirsson, Chungsoo Kim, Luis H John, Ross D Williams, Renae R Fisher, Patrick B Ryan
{"title":"Finding a constrained number of predictor phenotypes for multiple outcome prediction.","authors":"Jenna M Reps, Jenna Wong, Egill A Fridgeirsson, Chungsoo Kim, Luis H John, Ross D Williams, Renae R Fisher, Patrick B Ryan","doi":"10.1136/bmjhci-2024-101227","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Prognostic models help aid medical decision-making. Various prognostic models are available via websites such as MDCalc, but these models typically predict one outcome, for example, stroke risk. Each model requires individual predictors, for example, age, lab results and comorbidities. There is no clinical tool available to predict multiple outcomes from a list of common medical predictors.</p><p><strong>Objective: </strong>Identify a constrained set of outcome-agnostic predictors.</p><p><strong>Methods: </strong>We proposed a novel technique aggregating the standardised mean difference across hundreds of outcomes to learn a constrained set of predictors that appear to be predictive of many outcomes. Model performance was evaluated using the constrained set of predictors across eight prediction tasks. We compared against existing models, models using only age/sex predictors and models without any predictor constraints.</p><p><strong>Results: </strong>We identified 67 predictors in our constrained set, plus age/sex. Our predictors included illnesses in the following categories: cardiovascular, kidney/liver, mental health, gastrointestinal, infectious and oncologic. Models developed using the constrained set of predictors achieved comparable discrimination compared with models using hundreds or thousands of predictors for five of the eight prediction tasks and slightly lower discrimination for three of the eight tasks. The constrained predictor models performed as good or better than all existing clinical models.</p><p><strong>Conclusions: </strong>It is possible to develop models for hundreds or thousands of outcomes that use the same small set of predictors. This makes it feasible to implement many prediction models via a single website form. Our set of predictors can also be used for future models and prognostic model research.</p>","PeriodicalId":9050,"journal":{"name":"BMJ Health & Care Informatics","volume":"32 1","pages":""},"PeriodicalIF":4.1000,"publicationDate":"2025-01-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11751890/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMJ Health & Care Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1136/bmjhci-2024-101227","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Prognostic models help aid medical decision-making. Various prognostic models are available via websites such as MDCalc, but these models typically predict one outcome, for example, stroke risk. Each model requires individual predictors, for example, age, lab results and comorbidities. There is no clinical tool available to predict multiple outcomes from a list of common medical predictors.

Objective: Identify a constrained set of outcome-agnostic predictors.

Methods: We proposed a novel technique aggregating the standardised mean difference across hundreds of outcomes to learn a constrained set of predictors that appear to be predictive of many outcomes. Model performance was evaluated using the constrained set of predictors across eight prediction tasks. We compared against existing models, models using only age/sex predictors and models without any predictor constraints.

Results: We identified 67 predictors in our constrained set, plus age/sex. Our predictors included illnesses in the following categories: cardiovascular, kidney/liver, mental health, gastrointestinal, infectious and oncologic. Models developed using the constrained set of predictors achieved comparable discrimination compared with models using hundreds or thousands of predictors for five of the eight prediction tasks and slightly lower discrimination for three of the eight tasks. The constrained predictor models performed as good or better than all existing clinical models.

Conclusions: It is possible to develop models for hundreds or thousands of outcomes that use the same small set of predictors. This makes it feasible to implement many prediction models via a single website form. Our set of predictors can also be used for future models and prognostic model research.

为多结果预测寻找有限数量的预测因子表型。
背景:预后模型有助于医疗决策。通过MDCalc等网站可以获得各种预后模型,但这些模型通常预测一种结果,例如中风风险。每个模型都需要单独的预测因素,例如年龄、实验室结果和合并症。目前还没有临床工具可以从常见的医学预测因子列表中预测多种结果。目的:确定一组约束的结果不可知预测因子。方法:我们提出了一种新技术,汇总数百个结果的标准化平均差异,以学习一组约束的预测因子,这些预测因子似乎可以预测许多结果。在八个预测任务中使用约束的预测器集评估模型性能。我们比较了现有的模型,仅使用年龄/性别预测因子的模型和没有任何预测因子约束的模型。结果:我们在约束集中确定了67个预测因子,加上年龄/性别。我们的预测指标包括以下类别的疾病:心血管疾病、肾脏/肝脏疾病、心理健康疾病、胃肠道疾病、传染病和肿瘤疾病。与使用数百或数千个预测因子的模型相比,使用约束预测因子集开发的模型在8个预测任务中的5个任务中实现了相当的歧视,在8个任务中的3个任务中略有降低的歧视。约束预测模型的表现与所有现有的临床模型一样好或更好。结论:有可能为使用相同的一小组预测因子的数百或数千种结果开发模型。这使得通过一个单一的网站表单实现许多预测模型成为可能。我们的预测因子集也可用于未来模型和预测模型研究。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
6.10
自引率
4.90%
发文量
40
审稿时长
18 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信