A machine learning tool for identifying newly diagnosed heart failure in individuals with known diabetes in primary care.

IF 3.2 2区 医学 Q2 CARDIAC & CARDIOVASCULAR SYSTEMS
Per Wändell, Axel C Carlsson, Julia Eriksson, Caroline Wachtler, Toralph Ruge
{"title":"A machine learning tool for identifying newly diagnosed heart failure in individuals with known diabetes in primary care.","authors":"Per Wändell, Axel C Carlsson, Julia Eriksson, Caroline Wachtler, Toralph Ruge","doi":"10.1002/ehf2.15115","DOIUrl":null,"url":null,"abstract":"<p><strong>Aims: </strong>We aimed to create a predictive model utilizing machine learning (ML) to identify new cases of congestive heart failure (CHF) in individuals with diabetes in primary health care (PHC) through the analysis of diagnostic data.</p><p><strong>Methods: </strong>We used a sex- and age-matched case-control design. Cases of new CHF were identified across all outpatient care settings 2015-2022 (n = 9098). We included individuals 30 years and above, by sex and age groups of 30-65 years and >65 years. The controls (five per case) were sampled from the individuals in 2015-2022 without CHF at any time between 2010 and 2022, in total 45 490. From the stochastic gradient boosting (SGB) technique model, we obtained a rank of the 10 most important factors related to newly diagnosed CHF in individuals with diabetes, with the normalized relative influence (NRI) score and a corresponding odds ratio of marginal effects (OR<sub>ME</sub>). Area under curve (AUC) was calculated.</p><p><strong>Results: </strong>For women 30-65 years and >65 years, we identified 488 and 3240 new cases of CHF, respectively, and men 30-65 years and >65 years 1196 and 4174 new cases. Among the 10 most important factors in the four groups (divided by sex and lower and higher age) for newly diagnosed CHF, we found the number of visits 12 months before diagnosis (NRI 44.3%-55.9%), coronary artery disease (NRI 2.9%-7.8%), atrial fibrillation and flutter (NRI 6.6%-12.2%) and 'abnormalities of breathing' (ICD-10 code R06) (NRI 2.6%-4.4%) were predictive in all groups. For younger women, a diagnosis of COPD (NRI 2.7%) contributed to the predictive effect, while for older women, oedema (NRI 3.1%) and number of years with diabetes (NRI 3.5%) contributed to the predictive effect. For men in both age groups, chronic renal disease had predictive effect (NRI 3.9%-5.1%) The model prediction of CHF among patients with diabetes was high, AUC around 0.85 for the four groups, and with sensitivity over 0.783 and specificity over 0.708 for all four groups.</p><p><strong>Conclusions: </strong>An SGB model using routinely collected data about diagnoses and number of visits in primary care, can accurately predict risk for diagnosis of heart failure in individuals with diabetes. Age and sex difference in predictive factors warrant further examination.</p>","PeriodicalId":11864,"journal":{"name":"ESC Heart Failure","volume":" ","pages":""},"PeriodicalIF":3.2000,"publicationDate":"2024-10-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ESC Heart Failure","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1002/ehf2.15115","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CARDIAC & CARDIOVASCULAR SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Aims: We aimed to create a predictive model utilizing machine learning (ML) to identify new cases of congestive heart failure (CHF) in individuals with diabetes in primary health care (PHC) through the analysis of diagnostic data.

Methods: We used a sex- and age-matched case-control design. Cases of new CHF were identified across all outpatient care settings 2015-2022 (n = 9098). We included individuals 30 years and above, by sex and age groups of 30-65 years and >65 years. The controls (five per case) were sampled from the individuals in 2015-2022 without CHF at any time between 2010 and 2022, in total 45 490. From the stochastic gradient boosting (SGB) technique model, we obtained a rank of the 10 most important factors related to newly diagnosed CHF in individuals with diabetes, with the normalized relative influence (NRI) score and a corresponding odds ratio of marginal effects (ORME). Area under curve (AUC) was calculated.

Results: For women 30-65 years and >65 years, we identified 488 and 3240 new cases of CHF, respectively, and men 30-65 years and >65 years 1196 and 4174 new cases. Among the 10 most important factors in the four groups (divided by sex and lower and higher age) for newly diagnosed CHF, we found the number of visits 12 months before diagnosis (NRI 44.3%-55.9%), coronary artery disease (NRI 2.9%-7.8%), atrial fibrillation and flutter (NRI 6.6%-12.2%) and 'abnormalities of breathing' (ICD-10 code R06) (NRI 2.6%-4.4%) were predictive in all groups. For younger women, a diagnosis of COPD (NRI 2.7%) contributed to the predictive effect, while for older women, oedema (NRI 3.1%) and number of years with diabetes (NRI 3.5%) contributed to the predictive effect. For men in both age groups, chronic renal disease had predictive effect (NRI 3.9%-5.1%) The model prediction of CHF among patients with diabetes was high, AUC around 0.85 for the four groups, and with sensitivity over 0.783 and specificity over 0.708 for all four groups.

Conclusions: An SGB model using routinely collected data about diagnoses and number of visits in primary care, can accurately predict risk for diagnosis of heart failure in individuals with diabetes. Age and sex difference in predictive factors warrant further examination.

用于识别初级保健中已知糖尿病患者新诊断出的心力衰竭的机器学习工具。
目的:我们旨在利用机器学习(ML)创建一个预测模型,通过分析诊断数据来识别初级卫生保健(PHC)中糖尿病患者的充血性心力衰竭(CHF)新病例:我们采用了性别和年龄匹配的病例对照设计。方法:我们采用了性别和年龄匹配的病例对照设计,在2015-2022年的所有门诊医疗机构中发现了新的CHF病例(n = 9098)。我们按性别、30-65 岁年龄组和 65 岁以上年龄组纳入了 30 岁及以上的患者。对照组(每个病例 5 个)从 2015-2022 年间在 2010 年至 2022 年间任何时间都没有患有慢性阻塞性肺病的人中抽取,共计 45 490 人。通过随机梯度提升(SGB)技术模型,我们得出了与糖尿病患者新诊断出的慢性心肌梗死相关的 10 个最重要因素的排名,并得出了归一化相对影响(NRI)得分和相应的边际效应几率(ORME)。计算了曲线下面积(AUC):在 30-65 岁和大于 65 岁的女性中,我们分别发现了 488 例和 3240 例新的心房颤动病例;在 30-65 岁和大于 65 岁的男性中,分别发现了 1196 例和 4174 例新的心房颤动病例。在四组(按性别、低龄和高龄划分)新诊断为慢性心力衰竭的 10 个最重要因素中,我们发现诊断前 12 个月的就诊次数(NRI 44.3%-55.9%)、冠状动脉疾病(NRI 2.9%-7.8%)、心房颤动和扑动(NRI 6.6%-12.2%)和 "呼吸异常"(ICD-10 代码 R06)(NRI 2.6%-4.4%)在所有组别中都具有预测作用。对年轻女性而言,慢性阻塞性肺病诊断(NRI 2.7%)有助于提高预测效果,而对老年女性而言,水肿(NRI 3.1%)和糖尿病患病年数(NRI 3.5%)有助于提高预测效果。对两个年龄组的男性而言,慢性肾脏病都有预测作用(NRI 3.9%-5.1%)。该模型对糖尿病患者的慢性心肌梗死预测率很高,四个组别的AUC均在0.85左右,所有四个组别的灵敏度均超过0.783,特异性均超过0.708:结论:SGB 模型利用常规收集的初级保健诊断和就诊次数数据,可准确预测糖尿病患者诊断心衰的风险。预测因素的年龄和性别差异值得进一步研究。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
ESC Heart Failure
ESC Heart Failure Medicine-Cardiology and Cardiovascular Medicine
CiteScore
7.00
自引率
7.90%
发文量
461
审稿时长
12 weeks
期刊介绍: ESC Heart Failure is the open access journal of the Heart Failure Association of the European Society of Cardiology dedicated to the advancement of knowledge in the field of heart failure. The journal aims to improve the understanding, prevention, investigation and treatment of heart failure. Molecular and cellular biology, pathology, physiology, electrophysiology, pharmacology, as well as the clinical, social and population sciences all form part of the discipline that is heart failure. Accordingly, submission of manuscripts on basic, translational, clinical and population sciences is invited. Original contributions on nursing, care of the elderly, primary care, health economics and other specialist fields related to heart failure are also welcome, as are case reports that highlight interesting aspects of heart failure care and treatment.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信