Application of machine learning and deep learning approaches for prediction modeling with time-to-event outcomes in clinical epidemiology. Methods comparison and practical considerations for generalizability and interpretability

IF 3 3区医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH

Annals of Epidemiology Pub Date : 2025-10-14 DOI:10.1016/j.annepidem.2025.10.012

Siona Prasad, Sabina A. Murphy, David A. Morrow, Benjamin S. Scirica, Marc S. Sabatine, David D. Berg, Andrea Bellavia

{"title":"Application of machine learning and deep learning approaches for prediction modeling with time-to-event outcomes in clinical epidemiology. Methods comparison and practical considerations for generalizability and interpretability","authors":"Siona Prasad, Sabina A. Murphy, David A. Morrow, Benjamin S. Scirica, Marc S. Sabatine, David D. Berg, Andrea Bellavia","doi":"10.1016/j.annepidem.2025.10.012","DOIUrl":null,"url":null,"abstract":"<div><h3>Purpose</h3><div>Clinical prediction models (CPM) are essential tools for diagnosis and prognosis in clinical epidemiology. Machine learning (ML) and deep learning (DL) approaches provide flexible methods that can complement regression-based methods for CPM when complex predictors such as clinical biomarkers are of interest. However, concerns have been raised on the ability of ML and DL to address desired properties of CPMs such as parsimony, generalizability, and interpretability.</div></div><div><h3>Methods</h3><div>In this study, we evaluated and applied selected regression-based, ML and DL approaches for time-to-event outcomes in a clinical study integrating protein biomarkers and lipids in an existing CPM for cardiovascular risk.</div></div><div><h3>Results</h3><div>We observed considerable advantages from the application of gradient boosting machines (GBM: C-statistic=0.72; Brier Score=0.052), which provided the best balance between model flexibility, discrimination, calibration, and parsimony, the latter being directly related to one of the model parameters (shrinking rate). Further, GBM results can be used for individual risk prediction, providing an interpretable tool for CPM implementation.</div></div><div><h3>Conclusions</h3><div>We compared ML and DL methods for CPM with time-to-event outcomes and discussed practical aspects of their implementation in clinical epidemiology including generalizability and interpretability. Adequately trained ML approaches can provide advantages in prediction modeling, especially when integrating complex predictors.</div></div>","PeriodicalId":50767,"journal":{"name":"Annals of Epidemiology","volume":"111 ","pages":"Pages 186-192"},"PeriodicalIF":3.0000,"publicationDate":"2025-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annals of Epidemiology","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1047279725003096","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}

引用次数: 0

Abstract

Purpose

Clinical prediction models (CPM) are essential tools for diagnosis and prognosis in clinical epidemiology. Machine learning (ML) and deep learning (DL) approaches provide flexible methods that can complement regression-based methods for CPM when complex predictors such as clinical biomarkers are of interest. However, concerns have been raised on the ability of ML and DL to address desired properties of CPMs such as parsimony, generalizability, and interpretability.

Methods

In this study, we evaluated and applied selected regression-based, ML and DL approaches for time-to-event outcomes in a clinical study integrating protein biomarkers and lipids in an existing CPM for cardiovascular risk.

Results

We observed considerable advantages from the application of gradient boosting machines (GBM: C-statistic=0.72; Brier Score=0.052), which provided the best balance between model flexibility, discrimination, calibration, and parsimony, the latter being directly related to one of the model parameters (shrinking rate). Further, GBM results can be used for individual risk prediction, providing an interpretable tool for CPM implementation.

Conclusions

We compared ML and DL methods for CPM with time-to-event outcomes and discussed practical aspects of their implementation in clinical epidemiology including generalizability and interpretability. Adequately trained ML approaches can provide advantages in prediction modeling, especially when integrating complex predictors.

查看原文本刊更多论文

机器学习和深度学习方法在临床流行病学中具有事件时间结果的预测建模中的应用。方法概括性与可解释性的比较与实践思考。

目的：临床预测模型（CPM）是临床流行病学诊断和预后的重要工具。机器学习（ML）和深度学习（DL）方法提供了灵活的方法，当对临床生物标志物等复杂预测因素感兴趣时，可以补充基于回归的CPM方法。然而，人们对ML和DL处理cpm所需属性的能力提出了关注，如简约性、概括性和可解释性。方法：在本研究中，我们在一项临床研究中评估并应用了基于回归的、ML和DL方法来评估事件发生时间的结果，该研究整合了现有CPM中心血管风险的蛋白质生物标志物和脂质。结果：我们观察到梯度增强机的应用具有相当大的优势（GBM: C-statistic=0.72; Brier Score=0.052），它提供了模型灵活性、判别性、校准性和简约性之间的最佳平衡，后者与模型参数之一（收缩率）直接相关。此外，GBM结果可用于个体风险预测，为CPM的实施提供了一个可解释的工具。结论：我们比较了ML和DL方法对CPM的时间-事件结果的影响，并讨论了它们在临床流行病学中实施的实际方面，包括普遍性和可解释性。经过充分训练的机器学习方法可以在预测建模方面提供优势，特别是在集成复杂预测器时。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Annals of Epidemiology 医学-公共卫生、环境卫生与职业卫生

CiteScore

7.40

自引率

1.80%

发文量

207

审稿时长

59 days

期刊介绍： The journal emphasizes the application of epidemiologic methods to issues that affect the distribution and determinants of human illness in diverse contexts. Its primary focus is on chronic and acute conditions of diverse etiologies and of major importance to clinical medicine, public health, and health care delivery.