Hendrik Schäfer, Vassilis Tsakanikas, René Garbsch, Mona Kotewitsch, Marc Teschler, Dimitris Gatsios, Dimitrios I Fotiadis, Frank C Mooren, Boris Schmitz
{"title":"A machine learning approach predicts improvement of physical exercise capacity based on pulse wave analysis in coronary artery disease patients.","authors":"Hendrik Schäfer, Vassilis Tsakanikas, René Garbsch, Mona Kotewitsch, Marc Teschler, Dimitris Gatsios, Dimitrios I Fotiadis, Frank C Mooren, Boris Schmitz","doi":"10.1016/j.jshs.2026.101144","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Patients with stable coronary artery disease (CAD) have a residual risk of adverse events and all-cause mortality. Enhancing exercise capacity by exercise training (ET) during cardiac rehabilitation (CR) is a Class 1A guideline recommendation. However, there is a high number of ET non-responders (NR) in CR. We aimed to develop a machine learning (ML) prediction model for the early identification of NR using baseline cardiopulmonary exercise testing (CPET) and pulse wave analysis (PWA) data.</p><p><strong>Methods: </strong>Participants included 393 CAD patients after myocardial infarction and/or percutaneous coronary intervention and/or coronary artery bypass graft who underwent 3-4 weeks of CR; CPET was conducted at the beginning and end of CR. Responders (R) were defined as participants who demonstrated an increase in exercise capacity (peak oxygen uptake (V̇O₂<sub>peak</sub>)) greater than 1 typical error away from 0, all other patients were defined as NR. Only baseline data including diagnosis, medication, PWA, and CPET data were used for modeling, and ML models included 10 different supervised algorithms. The dataset was split into training and test sets, and 10-fold cross-validation was used. Recursive Feature Elimination was used for feature selection to reduce dimensionality and improve generalizability. Independent external validation was performed in a dataset of CAD patients (n = 120) enrolled at 2 different centers (Germany and Spain). Predictions were explained using the model-agnostic SHapley Additive exPlanation methodology.</p><p><strong>Results: </strong>After data cleaning, 353 patients (20.4% women) with age of 55.8 ± 7.1 years (mean ± SD) were included for analysis, and 225 patients (63.7%) were classified as NR (22% women; age: 56.2 ± 7.1 years). ET participation rates were similar (R: 93.6% ± 7.5%; NR: 92.6% ± 9.3%; p = 0.76). For the prediction model, the Random Forest classifier provided the best mean balanced accuracy of 77.0%. The most influential features were breathing reserve/frequency, oxygen uptake combined with pulse wave velocity, cardiac output, and augmentation time. Of note, primary diagnosis and disease severity had only limited influence on the model. External validation of the Random Forest model showed 82.8% accuracy, with high specificity and moderate sensitivity in long-term outcome prediction.</p><p><strong>Conclusion: </strong>The developed ML-based model enables an early identification of ET NR, allowing individual patient-centered ET adaptations to improve CR.</p>","PeriodicalId":48897,"journal":{"name":"Journal of Sport and Health Science","volume":" ","pages":"101144"},"PeriodicalIF":10.3000,"publicationDate":"2026-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Sport and Health Science","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.jshs.2026.101144","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HOSPITALITY, LEISURE, SPORT & TOURISM","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Patients with stable coronary artery disease (CAD) have a residual risk of adverse events and all-cause mortality. Enhancing exercise capacity by exercise training (ET) during cardiac rehabilitation (CR) is a Class 1A guideline recommendation. However, there is a high number of ET non-responders (NR) in CR. We aimed to develop a machine learning (ML) prediction model for the early identification of NR using baseline cardiopulmonary exercise testing (CPET) and pulse wave analysis (PWA) data.
Methods: Participants included 393 CAD patients after myocardial infarction and/or percutaneous coronary intervention and/or coronary artery bypass graft who underwent 3-4 weeks of CR; CPET was conducted at the beginning and end of CR. Responders (R) were defined as participants who demonstrated an increase in exercise capacity (peak oxygen uptake (V̇O₂peak)) greater than 1 typical error away from 0, all other patients were defined as NR. Only baseline data including diagnosis, medication, PWA, and CPET data were used for modeling, and ML models included 10 different supervised algorithms. The dataset was split into training and test sets, and 10-fold cross-validation was used. Recursive Feature Elimination was used for feature selection to reduce dimensionality and improve generalizability. Independent external validation was performed in a dataset of CAD patients (n = 120) enrolled at 2 different centers (Germany and Spain). Predictions were explained using the model-agnostic SHapley Additive exPlanation methodology.
Results: After data cleaning, 353 patients (20.4% women) with age of 55.8 ± 7.1 years (mean ± SD) were included for analysis, and 225 patients (63.7%) were classified as NR (22% women; age: 56.2 ± 7.1 years). ET participation rates were similar (R: 93.6% ± 7.5%; NR: 92.6% ± 9.3%; p = 0.76). For the prediction model, the Random Forest classifier provided the best mean balanced accuracy of 77.0%. The most influential features were breathing reserve/frequency, oxygen uptake combined with pulse wave velocity, cardiac output, and augmentation time. Of note, primary diagnosis and disease severity had only limited influence on the model. External validation of the Random Forest model showed 82.8% accuracy, with high specificity and moderate sensitivity in long-term outcome prediction.
Conclusion: The developed ML-based model enables an early identification of ET NR, allowing individual patient-centered ET adaptations to improve CR.
期刊介绍:
The Journal of Sport and Health Science (JSHS) is an international, multidisciplinary journal that aims to advance the fields of sport, exercise, physical activity, and health sciences. Published by Elsevier B.V. on behalf of Shanghai University of Sport, JSHS is dedicated to promoting original and impactful research, as well as topical reviews, editorials, opinions, and commentary papers.
With a focus on physical and mental health, injury and disease prevention, traditional Chinese exercise, and human performance, JSHS offers a platform for scholars and researchers to share their findings and contribute to the advancement of these fields. Our journal is peer-reviewed, ensuring that all published works meet the highest academic standards.
Supported by a carefully selected international editorial board, JSHS upholds impeccable integrity and provides an efficient publication platform. We invite submissions from scholars and researchers worldwide, and we are committed to disseminating insightful and influential research in the field of sport and health science.