Developing and testing models to predict mortality in the general population

Informatics for Health and Social Care Pub Date : 2020-04-02 DOI:10.1080/17538157.2019.1656209

A. Goldfarb-Rumyantzev, Robert S Brown, N. Dong, G. Sandhu, Parag Vohra, S. Gautam

{"title":"Developing and testing models to predict mortality in the general population","authors":"A. Goldfarb-Rumyantzev, Robert S Brown, N. Dong, G. Sandhu, Parag Vohra, S. Gautam","doi":"10.1080/17538157.2019.1656209","DOIUrl":null,"url":null,"abstract":"ABSTRACT We have previously proposed an approach using information collected from published reports to generate prediction models. The goal of this project was to validate this technique to develop and test various prediction models. A risk indicator (R) is calculated as a linear combination of the hazard ratios for the following predictors: age, male gender, diabetes, albuminuria, and either CKD, CVD or both. We developed a linear and two exponential expressions to predict the probability of the outcome of 2-year mortality and compared to actual outcome in the target dataset from NHANES. The risk indicator demonstrated good performance with area under ROC curve of 0.84. The linear and two exponential expressions generated similar predictions in the lower categories of risk indicator (R ≤ 6). However, in the groups with higher R value, the linear expression tends to predict lower, and the exponential expressions higher, probabilities than the observed outcome. A Combined model which averaged the linear and logistic expressions was shown to approximate the actual outcome data the best. A simple technique (named Woodpecker™) allows derivation functional prediction models and risk stratification tools from reports of clinical outcome studies and their application to new populations by using only summary statistics of the new population.","PeriodicalId":440622,"journal":{"name":"Informatics for Health and Social Care","volume":"82 3","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Informatics for Health and Social Care","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/17538157.2019.1656209","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

ABSTRACT We have previously proposed an approach using information collected from published reports to generate prediction models. The goal of this project was to validate this technique to develop and test various prediction models. A risk indicator (R) is calculated as a linear combination of the hazard ratios for the following predictors: age, male gender, diabetes, albuminuria, and either CKD, CVD or both. We developed a linear and two exponential expressions to predict the probability of the outcome of 2-year mortality and compared to actual outcome in the target dataset from NHANES. The risk indicator demonstrated good performance with area under ROC curve of 0.84. The linear and two exponential expressions generated similar predictions in the lower categories of risk indicator (R ≤ 6). However, in the groups with higher R value, the linear expression tends to predict lower, and the exponential expressions higher, probabilities than the observed outcome. A Combined model which averaged the linear and logistic expressions was shown to approximate the actual outcome data the best. A simple technique (named Woodpecker™) allows derivation functional prediction models and risk stratification tools from reports of clinical outcome studies and their application to new populations by using only summary statistics of the new population.

查看原文本刊更多论文

开发和测试预测一般人群死亡率的模型

我们之前提出了一种方法，利用从已发表的报告中收集的信息来生成预测模型。该项目的目标是验证该技术，以开发和测试各种预测模型。风险指标(R)计算为以下预测因素风险比的线性组合:年龄、男性、糖尿病、蛋白尿、CKD、CVD或两者兼而有之。我们开发了一个线性和两个指数表达式来预测2年死亡率结果的概率，并将其与NHANES目标数据集中的实际结果进行比较。风险指标表现良好，ROC曲线下面积为0.84。在较低的风险指标类别(R≤6)中，线性表达式和两个指数表达式的预测结果相似，但在R值较高的组中，线性表达式的预测概率倾向于较低，指数表达式的预测概率高于观察结果。将线性表达式和逻辑表达式平均的组合模型最接近实际结果数据。一种简单的技术(名为Woodpecker™)允许从临床结果研究报告中推导功能预测模型和风险分层工具，并通过仅使用新人群的汇总统计数据将其应用于新人群。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Informatics for Health and Social Care

自引率

0.00%

发文量