Evaluation of changes in prediction modelling in biomedicine using systematic reviews.

IF 3.9 3区医学 Q1 HEALTH CARE SCIENCES & SERVICES

BMC Medical Research Methodology Pub Date : 2025-07-01 DOI:10.1186/s12874-025-02605-2

Lara Lusa, Franziska Kappenberg, Gary S Collins, Matthias Schmid, Willi Sauerbrei, Jörg Rahnenführer

{"title":"Evaluation of changes in prediction modelling in biomedicine using systematic reviews.","authors":"Lara Lusa, Franziska Kappenberg, Gary S Collins, Matthias Schmid, Willi Sauerbrei, Jörg Rahnenführer","doi":"10.1186/s12874-025-02605-2","DOIUrl":null,"url":null,"abstract":"<p><p>The number of prediction models proposed in the biomedical literature has been growing year on year. In the last few years there has been an increasing attention to the changes occurring in the prediction modeling landscape. It is suggested that machine learning techniques are becoming more popular to develop prediction models to exploit complex data structures, higher-dimensional predictor spaces, very large number of participants, heterogeneous subgroups, with the ability to capture higher-order interactions. We examine the changes in modelling practices by investigating a selection of systematic reviews on prediction models published in the biomedical literature. We selected systematic reviews published between 2020 and 2022 which included at least 50 prediction models. Information was extracted guided by the CHARMS checklist. Time trends were explored using the models published since 2005. We identified 8 reviews, which included 1448 prediction models published in 887 papers. The average number of study participants and outcome events increased considerably between 2015 and 2019 but remained stable afterwards. The number of candidate and final predictors did not noticeably increase over the study period, with a few recent studies using very large numbers of predictors. Internal validation and reporting of discrimination measures became more common, but assessing calibration and carrying out external validation were less common. Information about missing values was not reported in about half of the papers, however the use of imputation methods increased. There was no sign of an increase in using of machine learning methods. Overall, most of the findings were heterogeneous across reviews. Our findings indicate that changes in the prediction modeling landscape in biomedicine are smaller than expected and that poor reporting is still common; adherence to well established best practice recommendations from the traditional biostatistics literature is still needed. For machine learning best practice recommendations are still missing, whereas such recommendations are available in the traditional biostatistics literature, but adherence is still inadequate.</p>","PeriodicalId":9114,"journal":{"name":"BMC Medical Research Methodology","volume":"25 1","pages":"167"},"PeriodicalIF":3.9000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12211957/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Research Methodology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12874-025-02605-2","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}

引用次数: 0

Abstract

The number of prediction models proposed in the biomedical literature has been growing year on year. In the last few years there has been an increasing attention to the changes occurring in the prediction modeling landscape. It is suggested that machine learning techniques are becoming more popular to develop prediction models to exploit complex data structures, higher-dimensional predictor spaces, very large number of participants, heterogeneous subgroups, with the ability to capture higher-order interactions. We examine the changes in modelling practices by investigating a selection of systematic reviews on prediction models published in the biomedical literature. We selected systematic reviews published between 2020 and 2022 which included at least 50 prediction models. Information was extracted guided by the CHARMS checklist. Time trends were explored using the models published since 2005. We identified 8 reviews, which included 1448 prediction models published in 887 papers. The average number of study participants and outcome events increased considerably between 2015 and 2019 but remained stable afterwards. The number of candidate and final predictors did not noticeably increase over the study period, with a few recent studies using very large numbers of predictors. Internal validation and reporting of discrimination measures became more common, but assessing calibration and carrying out external validation were less common. Information about missing values was not reported in about half of the papers, however the use of imputation methods increased. There was no sign of an increase in using of machine learning methods. Overall, most of the findings were heterogeneous across reviews. Our findings indicate that changes in the prediction modeling landscape in biomedicine are smaller than expected and that poor reporting is still common; adherence to well established best practice recommendations from the traditional biostatistics literature is still needed. For machine learning best practice recommendations are still missing, whereas such recommendations are available in the traditional biostatistics literature, but adherence is still inadequate.

查看原文本刊更多论文

用系统评价评价生物医学预测模型的变化。

生物医学文献中提出的预测模型数量逐年增长。在过去的几年里，人们越来越关注预测建模领域发生的变化。有人建议，机器学习技术正在变得越来越流行，以开发预测模型，以利用复杂的数据结构，高维预测空间，非常大的参与者数量，异构子组，并具有捕获高阶交互的能力。我们通过调查在生物医学文献中发表的预测模型的系统综述来检查建模实践的变化。我们选择了在2020年至2022年间发表的系统综述，其中包括至少50个预测模型。在CHARMS检查表的指导下提取信息。使用自2005年以来发布的模型探索了时间趋势。我们确定了8篇综述，其中包括发表在887篇论文中的1448个预测模型。研究参与者和结果事件的平均数量在2015年至2019年期间大幅增加，但之后保持稳定。候选预测因子和最终预测因子的数量在研究期间并没有显著增加，最近的一些研究使用了大量的预测因子。内部验证和歧视措施的报告变得更加普遍，但是评估校准和执行外部验证却不太常见。大约一半的论文没有报道关于缺失值的信息，然而，使用imputation方法增加了。没有迹象表明机器学习方法的使用有所增加。总的来说，大多数研究结果在综述中是异质的。我们的研究结果表明，生物医学预测模型格局的变化比预期的要小，报告不足仍然很普遍；仍然需要遵守传统生物统计学文献中完善的最佳实践建议。对于机器学习的最佳实践建议仍然缺失，而这些建议在传统的生物统计学文献中是可用的，但坚持仍然不足。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

BMC Medical Research Methodology 医学-卫生保健

CiteScore

6.50

自引率

2.50%

发文量

298

审稿时长

3-8 weeks

期刊介绍： BMC Medical Research Methodology is an open access journal publishing original peer-reviewed research articles in methodological approaches to healthcare research. Articles on the methodology of epidemiological research, clinical trials and meta-analysis/systematic review are particularly encouraged, as are empirical studies of the associations between choice of methodology and study outcomes. BMC Medical Research Methodology does not aim to publish articles describing scientific methods or techniques: these should be directed to the BMC journal covering the relevant biomedical subject area.