Comparison of machine learning and conventional statistical modeling for predicting readmission following acute heart failure hospitalization

IF 3.7 2区医学 Q1 CARDIAC & CARDIOVASCULAR SYSTEMS

American heart journal Pub Date : 2024-07-31 DOI:10.1016/j.ahj.2024.07.017

Karem Abdul-Samad BSc , Shihao Ma BASc , David E. Austin BCS , Alice Chong BSc , Chloe X. Wang PhD , Xuesong Wang MSc , Peter C. Austin PhD , Heather J. Ross MD MHSc , Bo Wang PhD , Douglas S. Lee MD PhD

{"title":"Comparison of machine learning and conventional statistical modeling for predicting readmission following acute heart failure hospitalization","authors":"Karem Abdul-Samad BSc , Shihao Ma BASc , David E. Austin BCS , Alice Chong BSc , Chloe X. Wang PhD , Xuesong Wang MSc , Peter C. Austin PhD , Heather J. Ross MD MHSc , Bo Wang PhD , Douglas S. Lee MD PhD","doi":"10.1016/j.ahj.2024.07.017","DOIUrl":null,"url":null,"abstract":"<div><h3>Introduction</h3><p>Developing accurate models for predicting the risk of 30-day readmission is a major healthcare interest. Evidence suggests that models developed using machine learning (ML) may have better discrimination than conventional statistical models (CSM), but the calibration of such models is unclear.</p></div><div><h3>Objectives</h3><p>To compare models developed using ML with those developed using CSM to predict 30-day readmission for cardiovascular and noncardiovascular causes in HF patients.</p></div><div><h3>Methods</h3><p>We retrospectively enrolled 10,919 patients with HF (> 18 years) discharged alive from a hospital or emergency department (2004-2007) in Ontario, Canada. The study sample was randomly divided into training and validation sets in a 2:1 ratio. CSMs to predict 30-day readmission were developed using Fine-Gray subdistribution hazards regression (treating death as a competing risk), and the ML algorithm employed random survival forests for competing risks (RSF-CR). Models were evaluated in the validation set using both discrimination and calibration metrics.</p></div><div><h3>Results</h3><p>In the validation sample of 3602 patients, RSF-CR (c-statistic=0.620) showed similar discrimination to the Fine-Gray competing risk model (c-statistic=0.621) for 30-day cardiovascular readmission. In contrast, for 30-day noncardiovascular readmission, the Fine-Gray model (c-statistic=0.641) slightly outperformed the RSF-CR model (c-statistic=0.632). For both outcomes, The Fine-Gray model displayed better calibration than RSF-CR using calibration plots of observed vs predicted risks across the deciles of predicted risk.</p></div><div><h3>Conclusions</h3><p>Fine-Gray models had similar discrimination but superior calibration to the RSF-CR model, highlighting the importance of reporting calibration metrics for ML-based prediction models. The discrimination was modest in all readmission prediction models regardless of the methods used.</p></div>","PeriodicalId":7868,"journal":{"name":"American heart journal","volume":"277 ","pages":"Pages 93-103"},"PeriodicalIF":3.7000,"publicationDate":"2024-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"American heart journal","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0002870324001832","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CARDIAC & CARDIOVASCULAR SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Introduction

Developing accurate models for predicting the risk of 30-day readmission is a major healthcare interest. Evidence suggests that models developed using machine learning (ML) may have better discrimination than conventional statistical models (CSM), but the calibration of such models is unclear.

Objectives

To compare models developed using ML with those developed using CSM to predict 30-day readmission for cardiovascular and noncardiovascular causes in HF patients.

Methods

We retrospectively enrolled 10,919 patients with HF (> 18 years) discharged alive from a hospital or emergency department (2004-2007) in Ontario, Canada. The study sample was randomly divided into training and validation sets in a 2:1 ratio. CSMs to predict 30-day readmission were developed using Fine-Gray subdistribution hazards regression (treating death as a competing risk), and the ML algorithm employed random survival forests for competing risks (RSF-CR). Models were evaluated in the validation set using both discrimination and calibration metrics.

Results

In the validation sample of 3602 patients, RSF-CR (c-statistic=0.620) showed similar discrimination to the Fine-Gray competing risk model (c-statistic=0.621) for 30-day cardiovascular readmission. In contrast, for 30-day noncardiovascular readmission, the Fine-Gray model (c-statistic=0.641) slightly outperformed the RSF-CR model (c-statistic=0.632). For both outcomes, The Fine-Gray model displayed better calibration than RSF-CR using calibration plots of observed vs predicted risks across the deciles of predicted risk.

Conclusions

Fine-Gray models had similar discrimination but superior calibration to the RSF-CR model, highlighting the importance of reporting calibration metrics for ML-based prediction models. The discrimination was modest in all readmission prediction models regardless of the methods used.

Abstract Image

查看原文本刊更多论文

机器学习与传统统计建模在预测急性心力衰竭住院后再入院方面的比较。

简介开发用于预测 30 天再入院风险的精确模型是医疗保健领域的一项重大课题。有证据表明，使用机器学习（ML）开发的模型可能比传统统计模型（CSM）具有更好的识别能力，但此类模型的校准尚不明确：比较使用 ML 和 CSM 开发的模型，以预测高血压患者因心血管和非心血管原因导致的 30 天再入院情况：我们回顾性招募了加拿大安大略省（2004-2007 年）10919 名从医院或急诊科活着出院的高血压患者（18 岁以上）。研究样本按 2:1 的比例随机分为训练集和验证集。预测 30 天再入院的 CSM 采用 Fine-Gray subdistribution hazards 回归（将死亡视为竞争风险），ML 算法采用竞争风险随机生存林 (RSF-CR)。在验证集中使用判别和校准指标对模型进行了评估：在 3602 名患者的验证样本中，RSF-CR（c-统计量=0.620）与 Fine-Gray 竞争风险模型（c-统计量=0.621）对 30 天心血管疾病再入院的区分度相似。相反，对于 30 天非心血管疾病再入院，Fine-Gray 模型（c-统计量=0.641）略优于 RSF-CR 模型（c-统计量=0.632）。对于这两种结果，Fine-Gray 模型的校准效果优于 RSF-CR：Fine-Gray模型与RSF-CR模型具有相似的区分度，但校准效果优于RSF-CR模型，这突出了报告基于ML的预测模型校准指标的重要性。无论使用哪种方法，所有再入院预测模型的区分度都不高。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

American heart journal 医学-心血管系统

CiteScore

8.20

自引率

2.10%

发文量

214

审稿时长

38 days

期刊介绍： The American Heart Journal will consider for publication suitable articles on topics pertaining to the broad discipline of cardiovascular disease. Our goal is to provide the reader primary investigation, scholarly review, and opinion concerning the practice of cardiovascular medicine. We especially encourage submission of 3 types of reports that are not frequently seen in cardiovascular journals: negative clinical studies, reports on study designs, and studies involving the organization of medical care. The Journal does not accept individual case reports or original articles involving bench laboratory or animal research.