Developing and testing models to predict mortality in the general population

A. Goldfarb-Rumyantzev, Robert S Brown, N. Dong, G. Sandhu, Parag Vohra, S. Gautam
{"title":"Developing and testing models to predict mortality in the general population","authors":"A. Goldfarb-Rumyantzev, Robert S Brown, N. Dong, G. Sandhu, Parag Vohra, S. Gautam","doi":"10.1080/17538157.2019.1656209","DOIUrl":null,"url":null,"abstract":"ABSTRACT We have previously proposed an approach using information collected from published reports to generate prediction models. The goal of this project was to validate this technique to develop and test various prediction models. A risk indicator (R) is calculated as a linear combination of the hazard ratios for the following predictors: age, male gender, diabetes, albuminuria, and either CKD, CVD or both. We developed a linear and two exponential expressions to predict the probability of the outcome of 2-year mortality and compared to actual outcome in the target dataset from NHANES. The risk indicator demonstrated good performance with area under ROC curve of 0.84. The linear and two exponential expressions generated similar predictions in the lower categories of risk indicator (R ≤ 6). However, in the groups with higher R value, the linear expression tends to predict lower, and the exponential expressions higher, probabilities than the observed outcome. A Combined model which averaged the linear and logistic expressions was shown to approximate the actual outcome data the best. A simple technique (named Woodpecker™) allows derivation functional prediction models and risk stratification tools from reports of clinical outcome studies and their application to new populations by using only summary statistics of the new population.","PeriodicalId":440622,"journal":{"name":"Informatics for Health and Social Care","volume":"82 3","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Informatics for Health and Social Care","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/17538157.2019.1656209","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

ABSTRACT We have previously proposed an approach using information collected from published reports to generate prediction models. The goal of this project was to validate this technique to develop and test various prediction models. A risk indicator (R) is calculated as a linear combination of the hazard ratios for the following predictors: age, male gender, diabetes, albuminuria, and either CKD, CVD or both. We developed a linear and two exponential expressions to predict the probability of the outcome of 2-year mortality and compared to actual outcome in the target dataset from NHANES. The risk indicator demonstrated good performance with area under ROC curve of 0.84. The linear and two exponential expressions generated similar predictions in the lower categories of risk indicator (R ≤ 6). However, in the groups with higher R value, the linear expression tends to predict lower, and the exponential expressions higher, probabilities than the observed outcome. A Combined model which averaged the linear and logistic expressions was shown to approximate the actual outcome data the best. A simple technique (named Woodpecker™) allows derivation functional prediction models and risk stratification tools from reports of clinical outcome studies and their application to new populations by using only summary statistics of the new population.
开发和测试预测一般人群死亡率的模型
我们之前提出了一种方法,利用从已发表的报告中收集的信息来生成预测模型。该项目的目标是验证该技术,以开发和测试各种预测模型。风险指标(R)计算为以下预测因素风险比的线性组合:年龄、男性、糖尿病、蛋白尿、CKD、CVD或两者兼而有之。我们开发了一个线性和两个指数表达式来预测2年死亡率结果的概率,并将其与NHANES目标数据集中的实际结果进行比较。风险指标表现良好,ROC曲线下面积为0.84。在较低的风险指标类别(R≤6)中,线性表达式和两个指数表达式的预测结果相似,但在R值较高的组中,线性表达式的预测概率倾向于较低,指数表达式的预测概率高于观察结果。将线性表达式和逻辑表达式平均的组合模型最接近实际结果数据。一种简单的技术(名为Woodpecker™)允许从临床结果研究报告中推导功能预测模型和风险分层工具,并通过仅使用新人群的汇总统计数据将其应用于新人群。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信