Developing clinical prediction models: a step-by-step guide

The BMJ Pub Date : 2024-09-03 DOI:10.1136/bmj-2023-078276

Orestis Efthimiou, Michael Seo, Konstantina Chalkou, Thomas Debray, Matthias Egger, Georgia Salanti

{"title":"Developing clinical prediction models: a step-by-step guide","authors":"Orestis Efthimiou, Michael Seo, Konstantina Chalkou, Thomas Debray, Matthias Egger, Georgia Salanti","doi":"10.1136/bmj-2023-078276","DOIUrl":null,"url":null,"abstract":"Predicting future outcomes of patients is essential to clinical practice, with many prediction models published each year. Empirical evidence suggests that published studies often have severe methodological limitations, which undermine their usefulness. This article presents a step-by-step guide to help researchers develop and evaluate a clinical prediction model. The guide covers best practices in defining the aim and users, selecting data sources, addressing missing data, exploring alternative modelling options, and assessing model performance. The steps are illustrated using an example from relapsing-remitting multiple sclerosis. Comprehensive R code is also provided. Clinical prediction models aim to forecast future health outcomes given a set of baseline predictors to facilitate medical decision making and improve people’s health outcomes.1 Prediction models are becoming increasingly popular, with many new ones published each year. For example, a review of prediction models identified 263 prediction models in obstetrics alone2; another review found 606 models related to covid-19.3 Interest in predicting health outcomes has been heightened by the increasing availability of big data,4 which has also led to the uptake of machine learning methods for prognostic research in medicine.56 Several resources are available to support prognostic research. The PROGRESS (prognosis research strategy) framework provides detailed guidance on different types of prognostic research.789 The TRIPOD (transparent reporting of a multivariable prediction model for individual prognosis or diagnosis) statement gives recommendations for reporting and has recently been extended to address prediction model research in clustered datasets.1011121314 PROBAST (prediction model risk-of-bias assessment tool) provides a structured way to assess the risk of bias in a prediction modelling study.15 Several papers further outline good practices and provide software code.161718 Despite these resources, published prediction modelling studies often have severe methodological limitations. For instance, …","PeriodicalId":22388,"journal":{"name":"The BMJ","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The BMJ","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1136/bmj-2023-078276","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Predicting future outcomes of patients is essential to clinical practice, with many prediction models published each year. Empirical evidence suggests that published studies often have severe methodological limitations, which undermine their usefulness. This article presents a step-by-step guide to help researchers develop and evaluate a clinical prediction model. The guide covers best practices in defining the aim and users, selecting data sources, addressing missing data, exploring alternative modelling options, and assessing model performance. The steps are illustrated using an example from relapsing-remitting multiple sclerosis. Comprehensive R code is also provided. Clinical prediction models aim to forecast future health outcomes given a set of baseline predictors to facilitate medical decision making and improve people’s health outcomes.1 Prediction models are becoming increasingly popular, with many new ones published each year. For example, a review of prediction models identified 263 prediction models in obstetrics alone2; another review found 606 models related to covid-19.3 Interest in predicting health outcomes has been heightened by the increasing availability of big data,4 which has also led to the uptake of machine learning methods for prognostic research in medicine.56 Several resources are available to support prognostic research. The PROGRESS (prognosis research strategy) framework provides detailed guidance on different types of prognostic research.789 The TRIPOD (transparent reporting of a multivariable prediction model for individual prognosis or diagnosis) statement gives recommendations for reporting and has recently been extended to address prediction model research in clustered datasets.1011121314 PROBAST (prediction model risk-of-bias assessment tool) provides a structured way to assess the risk of bias in a prediction modelling study.15 Several papers further outline good practices and provide software code.161718 Despite these resources, published prediction modelling studies often have severe methodological limitations. For instance, …

查看原文本刊更多论文

开发临床预测模型：分步指南

预测患者的未来预后对临床实践至关重要，每年都有许多预测模型发表。经验证据表明，已发表的研究往往存在严重的方法论局限性，从而削弱了其实用性。本文提供了一份循序渐进的指南，帮助研究人员开发和评估临床预测模型。该指南涵盖了定义目标和用户、选择数据源、处理缺失数据、探索其他建模方案以及评估模型性能等方面的最佳实践。以复发缓解型多发性硬化症为例对这些步骤进行了说明。此外还提供了全面的 R 代码。临床预测模型旨在根据一组基线预测因子预测未来的健康结果，以促进医疗决策并改善人们的健康状况1。例如，一项关于预测模型的综述发现，仅产科就有 263 个预测模型2 ；另一项综述发现，与 covid-19 相关的模型有 606 个。3 人们对预测健康结果的兴趣因大数据4 的日益普及而提高，这也导致了机器学习方法在医学预后研究中的应用。PROGRESS（预后研究策略）框架为不同类型的预后研究提供了详细指导。789 TRIPOD（针对个体预后或诊断的多变量预测模型的透明报告）声明为报告提供了建议，并在近期扩展到聚类数据集中的预测模型研究。PROBAST（预测模型偏倚风险评估工具）为评估预测建模研究中的偏倚风险提供了一种结构化的方法。例如，...

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

The BMJ

自引率

0.00%

发文量

文献相关原料

公司名称	产品信息	采购帮参考价格