Carine Savalli , André Henrique Alves Carneiro , Fabiano Barcellos Filho , Murilo Afonso Robiati Bigoto , Roberta Moreira Wichmann , Alexandre Dias Porto Chiavegatto Filho , on behalf of IACOV-BR Network
{"title":"Transfer learning for COVID-19 predictive modeling: A multicenter study of 12 hospitals","authors":"Carine Savalli , André Henrique Alves Carneiro , Fabiano Barcellos Filho , Murilo Afonso Robiati Bigoto , Roberta Moreira Wichmann , Alexandre Dias Porto Chiavegatto Filho , on behalf of IACOV-BR Network","doi":"10.1016/j.annepidem.2025.05.013","DOIUrl":null,"url":null,"abstract":"<div><h3>Purpose</h3><div>To use transfer learning to predict Intensive Care Unit admission (ICU) for COVID-19 patients in a multicenter setting.</div></div><div><h3>Methods</h3><div>We trained XGBoost algorithms using demographic and laboratory predictors recorded during hospital admissions across 12 hospitals. Our goal was to identify the best-performing hospital and evaluate its generalization capacity through external validation in the other 11 hospitals. Additionally, this best-performing hospital served as the source domain for transfer learning and was fine-tuned by incorporating new decision trees trained with local data from each of the remaining hospitals.</div></div><div><h3>Results</h3><div>We observed considerable variation in ICU admission outcomes across the 12 hospitals, with local predictive performance (AUC – area under the receiver operating characteristic curve) ranging from 0.6239 to 0.9410. The best-performing model's generalization capacity, evaluated through external validation, resulted in an AUC below 0.7 in 6 hospitals. By applying transfer learning and fine-tuning this best-performing model, we found that its performance stabilized after incorporating 20 new trees trained with data from the target hospitals. This adjustment improved AUC in 9 out of the 11 hospitals, compared to the results from the external validation.</div></div><div><h3>Conclusions</h3><div>The study showed that transfer learning effectively leverages knowledge from the best-performing hospital to initialize and accelerate model training in other hospitals with lower performance or limited sample sizes, thereby enhancing overall adaptability and efficiency.</div></div>","PeriodicalId":50767,"journal":{"name":"Annals of Epidemiology","volume":"108 ","pages":"Pages 1-7"},"PeriodicalIF":3.0000,"publicationDate":"2025-05-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Annals of Epidemiology","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1047279725001127","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose
To use transfer learning to predict Intensive Care Unit admission (ICU) for COVID-19 patients in a multicenter setting.
Methods
We trained XGBoost algorithms using demographic and laboratory predictors recorded during hospital admissions across 12 hospitals. Our goal was to identify the best-performing hospital and evaluate its generalization capacity through external validation in the other 11 hospitals. Additionally, this best-performing hospital served as the source domain for transfer learning and was fine-tuned by incorporating new decision trees trained with local data from each of the remaining hospitals.
Results
We observed considerable variation in ICU admission outcomes across the 12 hospitals, with local predictive performance (AUC – area under the receiver operating characteristic curve) ranging from 0.6239 to 0.9410. The best-performing model's generalization capacity, evaluated through external validation, resulted in an AUC below 0.7 in 6 hospitals. By applying transfer learning and fine-tuning this best-performing model, we found that its performance stabilized after incorporating 20 new trees trained with data from the target hospitals. This adjustment improved AUC in 9 out of the 11 hospitals, compared to the results from the external validation.
Conclusions
The study showed that transfer learning effectively leverages knowledge from the best-performing hospital to initialize and accelerate model training in other hospitals with lower performance or limited sample sizes, thereby enhancing overall adaptability and efficiency.
期刊介绍:
The journal emphasizes the application of epidemiologic methods to issues that affect the distribution and determinants of human illness in diverse contexts. Its primary focus is on chronic and acute conditions of diverse etiologies and of major importance to clinical medicine, public health, and health care delivery.