Martin Bernstorff, Lasse Hansen, Kevin Kris Warnakula Olesen, Andreas Aalkjær Danielsen, Søren Dinesen Østergaard
{"title":"Predicting cardiovascular disease in patients with mental illness using machine learning.","authors":"Martin Bernstorff, Lasse Hansen, Kevin Kris Warnakula Olesen, Andreas Aalkjær Danielsen, Søren Dinesen Østergaard","doi":"10.1192/j.eurpsy.2025.1","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Cardiovascular disease (CVD) is twice as prevalent among individuals with mental illness compared to the general population. Prevention strategies exist but require accurate risk prediction. This study aimed to develop and validate a machine learning model for predicting incident CVD among patients with mental illness using routine clinical data from electronic health records.</p><p><strong>Methods: </strong>A cohort study was conducted using data from 74,880 patients with 1.6 million psychiatric service contacts in the Central Denmark Region from 2013 to 2021. Two machine learning models (XGBoost and regularised logistic regression) were trained on 85% of the data from six hospitals using 234 potential predictors. The best-performing model was externally validated on the remaining 15% of patients from another three hospitals. CVD was defined as myocardial infarction, stroke, or peripheral arterial disease.</p><p><strong>Results: </strong>The best-performing model (hyperparameter-tuned XGBoost) demonstrated acceptable discrimination, with an area under the receiver operating characteristic curve of 0.84 on the training set and 0.74 on the validation set. It identified high-risk individuals 2.5 years before CVD events. For the psychiatric service contacts in the top 5% of predicted risk, the positive predictive value was 5%, and the negative predictive value was 99%. The model issued at least one positive prediction for 39% of patients who developed CVD.</p><p><strong>Conclusions: </strong>A machine learning model can accurately predict CVD risk among patients with mental illness using routinely collected electronic health record data. A decision support system building on this approach may aid primary CVD prevention in this high-risk population.</p>","PeriodicalId":12155,"journal":{"name":"European Psychiatry","volume":" ","pages":"e12"},"PeriodicalIF":7.2000,"publicationDate":"2025-01-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Psychiatry","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1192/j.eurpsy.2025.1","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PSYCHIATRY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Cardiovascular disease (CVD) is twice as prevalent among individuals with mental illness compared to the general population. Prevention strategies exist but require accurate risk prediction. This study aimed to develop and validate a machine learning model for predicting incident CVD among patients with mental illness using routine clinical data from electronic health records.
Methods: A cohort study was conducted using data from 74,880 patients with 1.6 million psychiatric service contacts in the Central Denmark Region from 2013 to 2021. Two machine learning models (XGBoost and regularised logistic regression) were trained on 85% of the data from six hospitals using 234 potential predictors. The best-performing model was externally validated on the remaining 15% of patients from another three hospitals. CVD was defined as myocardial infarction, stroke, or peripheral arterial disease.
Results: The best-performing model (hyperparameter-tuned XGBoost) demonstrated acceptable discrimination, with an area under the receiver operating characteristic curve of 0.84 on the training set and 0.74 on the validation set. It identified high-risk individuals 2.5 years before CVD events. For the psychiatric service contacts in the top 5% of predicted risk, the positive predictive value was 5%, and the negative predictive value was 99%. The model issued at least one positive prediction for 39% of patients who developed CVD.
Conclusions: A machine learning model can accurately predict CVD risk among patients with mental illness using routinely collected electronic health record data. A decision support system building on this approach may aid primary CVD prevention in this high-risk population.
期刊介绍:
European Psychiatry, the official journal of the European Psychiatric Association, is dedicated to sharing cutting-edge research, policy updates, and fostering dialogue among clinicians, researchers, and patient advocates in the fields of psychiatry, mental health, behavioral science, and neuroscience. This peer-reviewed, Open Access journal strives to publish the latest advancements across various mental health issues, including diagnostic and treatment breakthroughs, as well as advancements in understanding the biological foundations of mental, behavioral, and cognitive functions in both clinical and general population studies.