Sex-specific cardiovascular disease risk prediction using statistical learning and explainable artificial intelligence: the HUNT Study.

IF 8.4 2区 医学 Q1 CARDIAC & CARDIOVASCULAR SYSTEMS
Virginia De Martin Topranin, Atle Wiig-Fisketjøn, Emma Botten, Håvard Dalen, Mette Langaas, Anja Bye
{"title":"Sex-specific cardiovascular disease risk prediction using statistical learning and explainable artificial intelligence: the HUNT Study.","authors":"Virginia De Martin Topranin, Atle Wiig-Fisketjøn, Emma Botten, Håvard Dalen, Mette Langaas, Anja Bye","doi":"10.1093/eurjpc/zwaf135","DOIUrl":null,"url":null,"abstract":"<p><strong>Aims: </strong>Current risk prediction models, such as the Norwegian NORRISK 2, explain only a modest proportion of cardiovascular disease (CVD) incidence. This study aimed to develop improved sex-specific models for predicting the 10-year CVD risk as well as sex- and age-specific thresholds for intervention.</p><p><strong>Methods: </strong>Data from 31,946 participants (40-79 years) without prior CVD were analyzed. Data were randomly split into a training set (for estimation) and a test set (for model evaluation). An extreme gradient boosting (XGBoost) model was used to identify the most important predictive variables. Next, prediction models were developed on the training set for each sex separately using XGBoost and logistic regression. The models were evaluated on the test set using receiver-operating characteristic (ROC) and precision recall (PR) curves. Finally, age- and sex-specific thresholds for intervention were explored.</p><p><strong>Results: </strong>All traditional risk factors included in NORRISK 2 and the European SCORE2 model were important predictors for males, but not for females. Potential new risk predictors were identified. The XGBoost model improved CVD risk prediction for males: 0.013- and 0.012-unit increase in ROC-AUC compared to NORRISK 2 and SCORE2 respectively, and 12% and 11% increase in PR-AUC respectively. For females, neither the XGBoost nor logistic regression model performed significantly better than NORRISK 2 and SCORE2. Age- and sex-specific thresholds showed an improvement in sensitivity compared with NORRISK 2-suggested thresholds.</p><p><strong>Conclusions: </strong>By employing statistical learning and incorporating sex-specific risk factors, we propose improved risk prediction models for CVD in males. Introducing sex-specific thresholds for intervention could enhance CVD prevention for both sexes.</p>","PeriodicalId":12051,"journal":{"name":"European journal of preventive cardiology","volume":" ","pages":""},"PeriodicalIF":8.4000,"publicationDate":"2025-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European journal of preventive cardiology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1093/eurjpc/zwaf135","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CARDIAC & CARDIOVASCULAR SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Aims: Current risk prediction models, such as the Norwegian NORRISK 2, explain only a modest proportion of cardiovascular disease (CVD) incidence. This study aimed to develop improved sex-specific models for predicting the 10-year CVD risk as well as sex- and age-specific thresholds for intervention.

Methods: Data from 31,946 participants (40-79 years) without prior CVD were analyzed. Data were randomly split into a training set (for estimation) and a test set (for model evaluation). An extreme gradient boosting (XGBoost) model was used to identify the most important predictive variables. Next, prediction models were developed on the training set for each sex separately using XGBoost and logistic regression. The models were evaluated on the test set using receiver-operating characteristic (ROC) and precision recall (PR) curves. Finally, age- and sex-specific thresholds for intervention were explored.

Results: All traditional risk factors included in NORRISK 2 and the European SCORE2 model were important predictors for males, but not for females. Potential new risk predictors were identified. The XGBoost model improved CVD risk prediction for males: 0.013- and 0.012-unit increase in ROC-AUC compared to NORRISK 2 and SCORE2 respectively, and 12% and 11% increase in PR-AUC respectively. For females, neither the XGBoost nor logistic regression model performed significantly better than NORRISK 2 and SCORE2. Age- and sex-specific thresholds showed an improvement in sensitivity compared with NORRISK 2-suggested thresholds.

Conclusions: By employing statistical learning and incorporating sex-specific risk factors, we propose improved risk prediction models for CVD in males. Introducing sex-specific thresholds for intervention could enhance CVD prevention for both sexes.

利用统计学习和可解释人工智能预测心血管疾病的性别特异性风险:HUNT 研究。
目的:目前的风险预测模型(如挪威的 NORRISK 2)只能解释心血管疾病(CVD)发病率的一小部分。这项研究旨在开发改进的性别特异性模型,用于预测 10 年心血管疾病风险以及性别和年龄特异性干预阈值:分析了 31,946 名无心血管疾病史的参与者(40-79 岁)的数据。数据被随机分成训练集(用于估算)和测试集(用于模型评估)。使用极端梯度提升(XGBoost)模型来确定最重要的预测变量。然后,在训练集上使用 XGBoost 和逻辑回归分别为每种性别建立预测模型。在测试集上使用接收器操作特征曲线(ROC)和精确召回曲线(PR)对模型进行评估。最后,还探讨了针对不同年龄和性别的干预阈值:结果:NORRISK 2 和欧洲 SCORE2 模型中包含的所有传统风险因素对男性来说都是重要的预测因素,但对女性来说不是。研究还发现了潜在的新风险预测因素。XGBoost 模型改善了对男性心血管疾病风险的预测:与 NORRISK 2 和 SCORE2 相比,ROC-AUC 分别提高了 0.013 个单位和 0.012 个单位,PR-AUC 分别提高了 12% 和 11%。就女性而言,XGBoost 和逻辑回归模型的表现均未明显优于 NORRISK 2 和 SCORE2。与 NORRISK 2 建议的阈值相比,年龄和性别阈值的灵敏度有所提高:通过采用统计学习并结合性别特异性风险因素,我们提出了更好的男性心血管疾病风险预测模型。采用性别特异性阈值进行干预,可以加强对两性心血管疾病的预防。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
European journal of preventive cardiology
European journal of preventive cardiology CARDIAC & CARDIOVASCULAR SYSTEMS-
CiteScore
12.50
自引率
12.00%
发文量
601
审稿时长
3-8 weeks
期刊介绍: European Journal of Preventive Cardiology (EJPC) is an official journal of the European Society of Cardiology (ESC) and the European Association of Preventive Cardiology (EAPC). The journal covers a wide range of scientific, clinical, and public health disciplines related to cardiovascular disease prevention, risk factor management, cardiovascular rehabilitation, population science and public health, and exercise physiology. The categories covered by the journal include classical risk factors and treatment, lifestyle risk factors, non-modifiable cardiovascular risk factors, cardiovascular conditions, concomitant pathological conditions, sport cardiology, diagnostic tests, care settings, epidemiology, pharmacology and pharmacotherapy, machine learning, and artificial intelligence.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信