Developing a multi-label learning model to predict major adverse cardiovascular events in patients with unstable angina pectoris: A prospective cohort study
IF 4.1 2区 医学Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS
Jing Li , Hong Yang , Yu Zhang , Jingjing Yan , Jing Tian , Yanbo Zhang
{"title":"Developing a multi-label learning model to predict major adverse cardiovascular events in patients with unstable angina pectoris: A prospective cohort study","authors":"Jing Li , Hong Yang , Yu Zhang , Jingjing Yan , Jing Tian , Yanbo Zhang","doi":"10.1016/j.ijmedinf.2025.106037","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Major adverse cardiovascular events (MACE) represent critical endpoints in cardiovascular research. The occurrence of MACE in patients with unstable angina pectoris (UAP) exhibits multidimensional complexity. We employed multi-label learning (MLL) models to concurrently predict five distinct types of MACE.</div></div><div><h3>Methods</h3><div>This prospective observational cohort study analysed the 978 UAP patients from the Second Affiliated Hospital of Shanxi Medical University (Taiyuan, China) between July 1, 2017, and June 30, 2019. Three-year follow-up endpoints encompassed all-cause death, heart failure, stroke, myocardial infarction, and revascularization. We utilized ReliefF for Multi-label Feature Selection (RFML), Mutual Information-based Feature Selection (MIFS), and Scalable Criteria for Large label Set (SCLS) to identify significant prognostic variables. Nineteen MLL models were implemented, including Binary Relevance (BR), Classifier Chains (CC), Label Powerset (LP), Random k-Labelsets (RAkEL), Multi-label k-Nearest Neighbor, Twin Support Vector Machine to Multi-label Learning, and Wrapping multi-label learning with label-specific features generation. BR, CC, LP, and RAkEL models were constructed using four base classifiers: Decision Tree, Random Forest, Extreme Gradient Boosting, and Light Gradient Boosting Machine. Performance evaluation incorporated 12 different metrics.</div></div><div><h3>Results</h3><div>The RFML, MIFS, and SCLS respectively screened 18, 12, and 14 important features, and the MLL prediction performance based on RFML selected features was the best. Among the MLL models, RAkEL with Random Forest as the base classifier demonstrated superior predictive performance, achieving an Accuracy of 0.575 <span><math><mrow><mo>±</mo></mrow></math></span> 0.022, Precision of 0.646 <span><math><mrow><mo>±</mo></mrow></math></span> 0.029, Hamming loss of 0.159 <span><math><mrow><mo>±</mo></mrow></math></span> 0.008, One error of 0.425 <span><math><mrow><mo>±</mo></mrow></math></span> 0.022, Macro_F1 of 0.719 <span><math><mrow><mo>±</mo></mrow></math></span> 0.028, Micro_F1 of 0.740 <span><math><mrow><mo>±</mo></mrow></math></span> 0.011, Macro_AUC of 0.786 <span><math><mrow><mo>±</mo></mrow></math></span> 0.031, Micro_AUC of 0.806 <span><math><mrow><mo>±</mo></mrow></math></span> 0.030 and Multi-Brier Score of 0.115 <span><math><mrow><mo>±</mo></mrow></math></span> 0.035.</div></div><div><h3>Conclusions</h3><div>The RAkEL model with Random Forest as the base classifier significantly enhanced predictive accuracy for MACE in UAP patients. This approach provides a more comprehensive risk assessment, enabling clinicians to develop personalized treatment strategies and improve patient outcomes.</div></div>","PeriodicalId":54950,"journal":{"name":"International Journal of Medical Informatics","volume":"203 ","pages":"Article 106037"},"PeriodicalIF":4.1000,"publicationDate":"2025-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Medical Informatics","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1386505625002540","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Background
Major adverse cardiovascular events (MACE) represent critical endpoints in cardiovascular research. The occurrence of MACE in patients with unstable angina pectoris (UAP) exhibits multidimensional complexity. We employed multi-label learning (MLL) models to concurrently predict five distinct types of MACE.
Methods
This prospective observational cohort study analysed the 978 UAP patients from the Second Affiliated Hospital of Shanxi Medical University (Taiyuan, China) between July 1, 2017, and June 30, 2019. Three-year follow-up endpoints encompassed all-cause death, heart failure, stroke, myocardial infarction, and revascularization. We utilized ReliefF for Multi-label Feature Selection (RFML), Mutual Information-based Feature Selection (MIFS), and Scalable Criteria for Large label Set (SCLS) to identify significant prognostic variables. Nineteen MLL models were implemented, including Binary Relevance (BR), Classifier Chains (CC), Label Powerset (LP), Random k-Labelsets (RAkEL), Multi-label k-Nearest Neighbor, Twin Support Vector Machine to Multi-label Learning, and Wrapping multi-label learning with label-specific features generation. BR, CC, LP, and RAkEL models were constructed using four base classifiers: Decision Tree, Random Forest, Extreme Gradient Boosting, and Light Gradient Boosting Machine. Performance evaluation incorporated 12 different metrics.
Results
The RFML, MIFS, and SCLS respectively screened 18, 12, and 14 important features, and the MLL prediction performance based on RFML selected features was the best. Among the MLL models, RAkEL with Random Forest as the base classifier demonstrated superior predictive performance, achieving an Accuracy of 0.575 0.022, Precision of 0.646 0.029, Hamming loss of 0.159 0.008, One error of 0.425 0.022, Macro_F1 of 0.719 0.028, Micro_F1 of 0.740 0.011, Macro_AUC of 0.786 0.031, Micro_AUC of 0.806 0.030 and Multi-Brier Score of 0.115 0.035.
Conclusions
The RAkEL model with Random Forest as the base classifier significantly enhanced predictive accuracy for MACE in UAP patients. This approach provides a more comprehensive risk assessment, enabling clinicians to develop personalized treatment strategies and improve patient outcomes.
期刊介绍:
International Journal of Medical Informatics provides an international medium for dissemination of original results and interpretative reviews concerning the field of medical informatics. The Journal emphasizes the evaluation of systems in healthcare settings.
The scope of journal covers:
Information systems, including national or international registration systems, hospital information systems, departmental and/or physician''s office systems, document handling systems, electronic medical record systems, standardization, systems integration etc.;
Computer-aided medical decision support systems using heuristic, algorithmic and/or statistical methods as exemplified in decision theory, protocol development, artificial intelligence, etc.
Educational computer based programs pertaining to medical informatics or medicine in general;
Organizational, economic, social, clinical impact, ethical and cost-benefit aspects of IT applications in health care.