A. Goldfarb-Rumyantzev, Robert S Brown, N. Dong, G. Sandhu, Parag Vohra, S. Gautam
{"title":"Developing and testing models to predict mortality in the general population","authors":"A. Goldfarb-Rumyantzev, Robert S Brown, N. Dong, G. Sandhu, Parag Vohra, S. Gautam","doi":"10.1080/17538157.2019.1656209","DOIUrl":null,"url":null,"abstract":"ABSTRACT We have previously proposed an approach using information collected from published reports to generate prediction models. The goal of this project was to validate this technique to develop and test various prediction models. A risk indicator (R) is calculated as a linear combination of the hazard ratios for the following predictors: age, male gender, diabetes, albuminuria, and either CKD, CVD or both. We developed a linear and two exponential expressions to predict the probability of the outcome of 2-year mortality and compared to actual outcome in the target dataset from NHANES. The risk indicator demonstrated good performance with area under ROC curve of 0.84. The linear and two exponential expressions generated similar predictions in the lower categories of risk indicator (R ≤ 6). However, in the groups with higher R value, the linear expression tends to predict lower, and the exponential expressions higher, probabilities than the observed outcome. A Combined model which averaged the linear and logistic expressions was shown to approximate the actual outcome data the best. A simple technique (named Woodpecker™) allows derivation functional prediction models and risk stratification tools from reports of clinical outcome studies and their application to new populations by using only summary statistics of the new population.","PeriodicalId":440622,"journal":{"name":"Informatics for Health and Social Care","volume":"82 3","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-04-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Informatics for Health and Social Care","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/17538157.2019.1656209","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
ABSTRACT We have previously proposed an approach using information collected from published reports to generate prediction models. The goal of this project was to validate this technique to develop and test various prediction models. A risk indicator (R) is calculated as a linear combination of the hazard ratios for the following predictors: age, male gender, diabetes, albuminuria, and either CKD, CVD or both. We developed a linear and two exponential expressions to predict the probability of the outcome of 2-year mortality and compared to actual outcome in the target dataset from NHANES. The risk indicator demonstrated good performance with area under ROC curve of 0.84. The linear and two exponential expressions generated similar predictions in the lower categories of risk indicator (R ≤ 6). However, in the groups with higher R value, the linear expression tends to predict lower, and the exponential expressions higher, probabilities than the observed outcome. A Combined model which averaged the linear and logistic expressions was shown to approximate the actual outcome data the best. A simple technique (named Woodpecker™) allows derivation functional prediction models and risk stratification tools from reports of clinical outcome studies and their application to new populations by using only summary statistics of the new population.