Glen P Martin, Richard D Riley, Joie Ensor, Stuart W Grant
{"title":"Statistical primer: sample size considerations for developing and validating clinical prediction models.","authors":"Glen P Martin, Richard D Riley, Joie Ensor, Stuart W Grant","doi":"10.1093/ejcts/ezaf142","DOIUrl":null,"url":null,"abstract":"<p><p>Clinical prediction models are statistical models or machine learning algorithms that combine information on a set of predictor variables about an individual to estimate their risk of a given clinical outcome. It is crucial to ensure that the sample size of the data used to develop or validate a clinical prediction model is large enough. If the data are inadequate, developed models can be unstable and estimates of predictive performance imprecise. This can lead to models that are unfit or even harmful for clinical practice. Recently, there have been a series of sample size formulae developed to estimate the minimum required sample size for prediction model development or external validation. The aim of this statistical primer is to provide an overview of these criteria, describe what information is required to make the calculations and illustrate their implementation through worked examples. The software that is available to implement the sample size criteria is reviewed, and code is provided for all the worked examples.</p>","PeriodicalId":11938,"journal":{"name":"European Journal of Cardio-Thoracic Surgery","volume":" ","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2025-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12106283/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Cardio-Thoracic Surgery","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1093/ejcts/ezaf142","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CARDIAC & CARDIOVASCULAR SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Clinical prediction models are statistical models or machine learning algorithms that combine information on a set of predictor variables about an individual to estimate their risk of a given clinical outcome. It is crucial to ensure that the sample size of the data used to develop or validate a clinical prediction model is large enough. If the data are inadequate, developed models can be unstable and estimates of predictive performance imprecise. This can lead to models that are unfit or even harmful for clinical practice. Recently, there have been a series of sample size formulae developed to estimate the minimum required sample size for prediction model development or external validation. The aim of this statistical primer is to provide an overview of these criteria, describe what information is required to make the calculations and illustrate their implementation through worked examples. The software that is available to implement the sample size criteria is reviewed, and code is provided for all the worked examples.
期刊介绍:
The primary aim of the European Journal of Cardio-Thoracic Surgery is to provide a medium for the publication of high-quality original scientific reports documenting progress in cardiac and thoracic surgery. The journal publishes reports of significant clinical and experimental advances related to surgery of the heart, the great vessels and the chest. The European Journal of Cardio-Thoracic Surgery is an international journal and accepts submissions from all regions. The journal is supported by a number of leading European societies.