Karem Abdul-Samad BSc , Shihao Ma BASc , David E. Austin BCS , Alice Chong BSc , Chloe X. Wang PhD , Xuesong Wang MSc , Peter C. Austin PhD , Heather J. Ross MD MHSc , Bo Wang PhD , Douglas S. Lee MD PhD
{"title":"Comparison of machine learning and conventional statistical modeling for predicting readmission following acute heart failure hospitalization","authors":"Karem Abdul-Samad BSc , Shihao Ma BASc , David E. Austin BCS , Alice Chong BSc , Chloe X. Wang PhD , Xuesong Wang MSc , Peter C. Austin PhD , Heather J. Ross MD MHSc , Bo Wang PhD , Douglas S. Lee MD PhD","doi":"10.1016/j.ahj.2024.07.017","DOIUrl":null,"url":null,"abstract":"<div><h3>Introduction</h3><p>Developing accurate models for predicting the risk of 30-day readmission is a major healthcare interest. Evidence suggests that models developed using machine learning (ML) may have better discrimination than conventional statistical models (CSM), but the calibration of such models is unclear.</p></div><div><h3>Objectives</h3><p>To compare models developed using ML with those developed using CSM to predict 30-day readmission for cardiovascular and noncardiovascular causes in HF patients.</p></div><div><h3>Methods</h3><p>We retrospectively enrolled 10,919 patients with HF (> 18 years) discharged alive from a hospital or emergency department (2004-2007) in Ontario, Canada. The study sample was randomly divided into training and validation sets in a 2:1 ratio. CSMs to predict 30-day readmission were developed using Fine-Gray subdistribution hazards regression (treating death as a competing risk), and the ML algorithm employed random survival forests for competing risks (RSF-CR). Models were evaluated in the validation set using both discrimination and calibration metrics.</p></div><div><h3>Results</h3><p>In the validation sample of 3602 patients, RSF-CR (c-statistic=0.620) showed similar discrimination to the Fine-Gray competing risk model (c-statistic=0.621) for 30-day cardiovascular readmission. In contrast, for 30-day noncardiovascular readmission, the Fine-Gray model (c-statistic=0.641) slightly outperformed the RSF-CR model (c-statistic=0.632). For both outcomes, The Fine-Gray model displayed better calibration than RSF-CR using calibration plots of observed vs predicted risks across the deciles of predicted risk.</p></div><div><h3>Conclusions</h3><p>Fine-Gray models had similar discrimination but superior calibration to the RSF-CR model, highlighting the importance of reporting calibration metrics for ML-based prediction models. The discrimination was modest in all readmission prediction models regardless of the methods used.</p></div>","PeriodicalId":7868,"journal":{"name":"American heart journal","volume":"277 ","pages":"Pages 93-103"},"PeriodicalIF":3.7000,"publicationDate":"2024-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"American heart journal","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0002870324001832","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CARDIAC & CARDIOVASCULAR SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction
Developing accurate models for predicting the risk of 30-day readmission is a major healthcare interest. Evidence suggests that models developed using machine learning (ML) may have better discrimination than conventional statistical models (CSM), but the calibration of such models is unclear.
Objectives
To compare models developed using ML with those developed using CSM to predict 30-day readmission for cardiovascular and noncardiovascular causes in HF patients.
Methods
We retrospectively enrolled 10,919 patients with HF (> 18 years) discharged alive from a hospital or emergency department (2004-2007) in Ontario, Canada. The study sample was randomly divided into training and validation sets in a 2:1 ratio. CSMs to predict 30-day readmission were developed using Fine-Gray subdistribution hazards regression (treating death as a competing risk), and the ML algorithm employed random survival forests for competing risks (RSF-CR). Models were evaluated in the validation set using both discrimination and calibration metrics.
Results
In the validation sample of 3602 patients, RSF-CR (c-statistic=0.620) showed similar discrimination to the Fine-Gray competing risk model (c-statistic=0.621) for 30-day cardiovascular readmission. In contrast, for 30-day noncardiovascular readmission, the Fine-Gray model (c-statistic=0.641) slightly outperformed the RSF-CR model (c-statistic=0.632). For both outcomes, The Fine-Gray model displayed better calibration than RSF-CR using calibration plots of observed vs predicted risks across the deciles of predicted risk.
Conclusions
Fine-Gray models had similar discrimination but superior calibration to the RSF-CR model, highlighting the importance of reporting calibration metrics for ML-based prediction models. The discrimination was modest in all readmission prediction models regardless of the methods used.
期刊介绍:
The American Heart Journal will consider for publication suitable articles on topics pertaining to the broad discipline of cardiovascular disease. Our goal is to provide the reader primary investigation, scholarly review, and opinion concerning the practice of cardiovascular medicine. We especially encourage submission of 3 types of reports that are not frequently seen in cardiovascular journals: negative clinical studies, reports on study designs, and studies involving the organization of medical care. The Journal does not accept individual case reports or original articles involving bench laboratory or animal research.