A systematic comparison of short-term and long-term mortality prediction in acute myocardial infarction using machine learning models.

IF 3.3 3区医学 Q2 MEDICAL INFORMATICS

BMC Medical Informatics and Decision Making Pub Date : 2025-06-05 DOI:10.1186/s12911-025-03052-1

Yawei Yang, Junjie Tang, Liping Ma, Feng Wu, Xiaoqing Guan

{"title":"A systematic comparison of short-term and long-term mortality prediction in acute myocardial infarction using machine learning models.","authors":"Yawei Yang, Junjie Tang, Liping Ma, Feng Wu, Xiaoqing Guan","doi":"10.1186/s12911-025-03052-1","DOIUrl":null,"url":null,"abstract":"Background and objective: The machine learning (ML) models for acute myocardial infarction (AMI) are considered to have better predictive ability for mortality compared to conventional risk scoring models. However, previous ML prediction models have mostly been short-term (1 year or less) models. Here, we established ML models for long-term prediction of AMI mortality (5 years or 10 years) and systematically compare the predictive capabilities of short-term models versus long-term models across varying survival time periods.Methods: An observational retrospective study was conducted to analyse mortality prediction in patients with varying survival times. A total of 4,173 patients were enrolled from two different hospitals in China. The dataset was allocated into three groups and an external test set based on their survival duration: the 1-year group (n = 3,626), the 5-year group (n = 2,102), the 10-year group (n = 721), and the external test set (n = 545). A comprehensive set of 53 variables was collected and utilized for model development. Mortality prediction was analysed using oversampling and feature selection methods coupled with machine learning algorithms. SHapley Additive exPlanations (SHAP) values were utilized to quantify the feature importance of AMI risk. The best-performing models from each group were selected for a systematic comparison of predictive accuracy using the external test set with follow-up exceeding 10 years but with varying survival times.Results: For the 1-year model, the RF model achieved the best performance on the test dataset, with an F1 score of 97.81% using only oversampling without feature selection. Conversely, in the case of the 5-years, the combination of LASSO and RF yielded the best performance, achieving F1 scores of 91.35% with both feature selection and oversampling. The best model of 10-years group was SVM with only oversampling without feature selection, yielding an F1 score of 80.7%. Age, BNP, and the Killip classification of AMI were consistently identified as robust predictors across all three groups. This underscores aging as a critical AMI risk factor contributing to mortality. However, despite the model's success, when examining the actual survival times of the 545 patients, of which 64% survived beyond 5 years and 37% beyond 10 years, the 1-year model failed to distinguish between these patients, predicting all as low risk. This highlights the limitation of short-term models, indicating their inability to accurately predict actual long-term survival times despite being commonly used in AMI mortality prediction.Conclusions: The study identifies Age, BNP, and Killip classification as consistent predictors of AMI mortality across all groups, with Age being the most significant factor. CBC parameters and renal biomarkers were pivotal in short-term models, while therapeutic interventions gained prominence over time. The 10-year group emphasised disease severity and treatment history, indicating survivorship bias. Short-term models, typically relying on data spanning 1 year or less, commonly established as predictive models for AMI risk, demonstrate limited capability in accurately predicting actual long-term survival times. To effectively issue early warnings for genuine long-term mortality risks, it is imperative to collect longer-term patient information and establish ML prediction models tailored to long-term outcomes. Further research is warranted to validate these findings in diverse populations.","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"208"},"PeriodicalIF":3.3000,"publicationDate":"2025-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12143097/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Informatics and Decision Making","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12911-025-03052-1","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}

引用次数: 0

Abstract

Background and objective: The machine learning (ML) models for acute myocardial infarction (AMI) are considered to have better predictive ability for mortality compared to conventional risk scoring models. However, previous ML prediction models have mostly been short-term (1 year or less) models. Here, we established ML models for long-term prediction of AMI mortality (5 years or 10 years) and systematically compare the predictive capabilities of short-term models versus long-term models across varying survival time periods.

Methods: An observational retrospective study was conducted to analyse mortality prediction in patients with varying survival times. A total of 4,173 patients were enrolled from two different hospitals in China. The dataset was allocated into three groups and an external test set based on their survival duration: the 1-year group (n = 3,626), the 5-year group (n = 2,102), the 10-year group (n = 721), and the external test set (n = 545). A comprehensive set of 53 variables was collected and utilized for model development. Mortality prediction was analysed using oversampling and feature selection methods coupled with machine learning algorithms. SHapley Additive exPlanations (SHAP) values were utilized to quantify the feature importance of AMI risk. The best-performing models from each group were selected for a systematic comparison of predictive accuracy using the external test set with follow-up exceeding 10 years but with varying survival times.

Results: For the 1-year model, the RF model achieved the best performance on the test dataset, with an F1 score of 97.81% using only oversampling without feature selection. Conversely, in the case of the 5-years, the combination of LASSO and RF yielded the best performance, achieving F1 scores of 91.35% with both feature selection and oversampling. The best model of 10-years group was SVM with only oversampling without feature selection, yielding an F1 score of 80.7%. Age, BNP, and the Killip classification of AMI were consistently identified as robust predictors across all three groups. This underscores aging as a critical AMI risk factor contributing to mortality. However, despite the model's success, when examining the actual survival times of the 545 patients, of which 64% survived beyond 5 years and 37% beyond 10 years, the 1-year model failed to distinguish between these patients, predicting all as low risk. This highlights the limitation of short-term models, indicating their inability to accurately predict actual long-term survival times despite being commonly used in AMI mortality prediction.

Conclusions: The study identifies Age, BNP, and Killip classification as consistent predictors of AMI mortality across all groups, with Age being the most significant factor. CBC parameters and renal biomarkers were pivotal in short-term models, while therapeutic interventions gained prominence over time. The 10-year group emphasised disease severity and treatment history, indicating survivorship bias. Short-term models, typically relying on data spanning 1 year or less, commonly established as predictive models for AMI risk, demonstrate limited capability in accurately predicting actual long-term survival times. To effectively issue early warnings for genuine long-term mortality risks, it is imperative to collect longer-term patient information and establish ML prediction models tailored to long-term outcomes. Further research is warranted to validate these findings in diverse populations.

查看原文本刊更多论文

使用机器学习模型对急性心肌梗死的短期和长期死亡率预测进行系统比较。

背景与目的：与传统的风险评分模型相比，急性心肌梗死（AMI）的机器学习（ML）模型被认为具有更好的死亡率预测能力。然而，以前的机器学习预测模型大多是短期的（1年或更短）模型。在这里，我们建立了用于AMI死亡率长期预测的ML模型（5年或10年），并系统地比较了短期模型和长期模型在不同生存期的预测能力。方法：采用观察性回顾性研究，分析不同生存期患者的死亡率预测。共有4173名患者从中国两家不同的医院入组。数据集根据患者的生存时间分为三组和一个外部测试集：1年组（n = 3,626）， 5年组（n = 2,102）， 10年组（n = 721）和外部测试集（n = 545）。收集了53个变量的综合集，并将其用于模型开发。使用过采样和特征选择方法结合机器学习算法分析死亡率预测。采用SHapley加性解释（SHAP）值量化AMI风险的特征重要性。从每组中选择表现最好的模型，使用随访超过10年但生存时间不同的外部测试集对预测准确性进行系统比较。结果：对于1年模型，RF模型在测试数据集上的表现最好，仅使用过采样而不进行特征选择的F1得分为97.81%。相反，在5年的情况下，LASSO和RF的组合表现最好，在特征选择和过采样的情况下，F1得分为91.35%。10岁组的最佳模型是只进行过采样而不进行特征选择的SVM， F1得分为80.7%。在所有三组中，年龄、BNP和AMI的Killip分类一致被认为是可靠的预测因素。这强调了衰老是导致死亡的关键AMI危险因素。然而，尽管该模型取得了成功，但在检查545名患者的实际生存时间时（其中64%的患者存活超过5年，37%的患者存活超过10年），1年模型未能区分这些患者，预测所有患者均为低风险。这突出了短期模型的局限性，表明它们无法准确预测实际的长期生存时间，尽管它们通常用于AMI死亡率预测。结论：该研究确定年龄、BNP和Killip分类是所有组AMI死亡率的一致预测因素，年龄是最重要的因素。CBC参数和肾脏生物标志物在短期模型中是关键的，而治疗干预随着时间的推移而得到突出。10年组强调疾病严重程度和治疗史，表明存在生存偏差。短期模型通常依赖于1年或更短时间的数据，通常被建立为AMI风险的预测模型，但在准确预测实际长期生存时间方面能力有限。为了有效地对真正的长期死亡风险发出早期预警，收集长期患者信息并建立适合长期结果的ML预测模型势在必行。需要进一步的研究在不同的人群中验证这些发现。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

BMC Medical Informatics and Decision Making 医学-医学：信息

CiteScore

7.20

自引率

5.70%

发文量

297

审稿时长

1 months

期刊介绍： BMC Medical Informatics and Decision Making is an open access journal publishing original peer-reviewed research articles in relation to the design, development, implementation, use, and evaluation of health information technologies and decision-making for human health.