Prediction of atrial fibrillation admissions in arrhythmia naïve patients from structured electronic health record data.

IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS
Tanmay Gokhale, Nirav R Bhatt, Matthew Starr, Suresh Mulukutla, Floyd Thoma, Murat Akcakaya, Salah Al-Zaiti, Raul G Nogueira, Samir Saba
{"title":"Prediction of atrial fibrillation admissions in arrhythmia naïve patients from structured electronic health record data.","authors":"Tanmay Gokhale, Nirav R Bhatt, Matthew Starr, Suresh Mulukutla, Floyd Thoma, Murat Akcakaya, Salah Al-Zaiti, Raul G Nogueira, Samir Saba","doi":"10.1186/s12911-025-03199-x","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Atrial fibrillation (AF) is the most prevalent sustained arrhythmia, but its diagnosis is often elusive. In this study, we examined the role of machine learning (ML) algorithms in predicting AF in arrhythmia-naïve patients, based on structured domains of the electronic health records (EHR).</p><p><strong>Methods: </strong>Patients (N = 186,769) with no prior history of AF, who received at least 1 echocardiogram and who had a minimum of 3 months of follow-up, were included. Data from the EHR were grouped into domains (demographic; social determinants of health; past medical history, medications, electrocardiogram (EKG), and echocardiogram (Echo)) and tested incrementally for their ability to predict incident AF admission to the hospital.</p><p><strong>Results: </strong>Of the overall cohort, 4,751 (2.5%) patients were admitted for AF over a median follow-up time of 35 months. Incremental EHR domains increased the area under the receiver-operator curve (AUROC) for all ML classifiers, with Gradient Boosting achieving an AUROC of 0.85 when all domains were included, but with a poor F1 score of 14% at the maximal Youden index. Using the EKG and Echo domains alone achieved comparable performance to when all EHR domains were included. These results were externally validated.</p><p><strong>Conclusion: </strong>More domains of structured EHR improve the ability to predict incident AF admissions but structured EKG and Echo domains realize the most gain. Although ML models exhibited good discrimination, the precision is poor due to the low event rate.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"348"},"PeriodicalIF":3.8000,"publicationDate":"2025-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12482350/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Informatics and Decision Making","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12911-025-03199-x","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Atrial fibrillation (AF) is the most prevalent sustained arrhythmia, but its diagnosis is often elusive. In this study, we examined the role of machine learning (ML) algorithms in predicting AF in arrhythmia-naïve patients, based on structured domains of the electronic health records (EHR).

Methods: Patients (N = 186,769) with no prior history of AF, who received at least 1 echocardiogram and who had a minimum of 3 months of follow-up, were included. Data from the EHR were grouped into domains (demographic; social determinants of health; past medical history, medications, electrocardiogram (EKG), and echocardiogram (Echo)) and tested incrementally for their ability to predict incident AF admission to the hospital.

Results: Of the overall cohort, 4,751 (2.5%) patients were admitted for AF over a median follow-up time of 35 months. Incremental EHR domains increased the area under the receiver-operator curve (AUROC) for all ML classifiers, with Gradient Boosting achieving an AUROC of 0.85 when all domains were included, but with a poor F1 score of 14% at the maximal Youden index. Using the EKG and Echo domains alone achieved comparable performance to when all EHR domains were included. These results were externally validated.

Conclusion: More domains of structured EHR improve the ability to predict incident AF admissions but structured EKG and Echo domains realize the most gain. Although ML models exhibited good discrimination, the precision is poor due to the low event rate.

Abstract Image

Abstract Image

Abstract Image

从结构化电子病历数据预测心律失常naïve患者心房颤动入院。
背景:心房颤动(AF)是最常见的持续性心律失常,但其诊断往往难以捉摸。在本研究中,我们基于电子健康记录(EHR)的结构化域,研究了机器学习(ML)算法在预测arrhythmia-naïve患者房颤中的作用。方法:纳入无房颤病史,接受至少1次超声心动图检查,随访至少3个月的患者(N = 186,769)。来自EHR的数据被分成不同的领域(人口统计学、健康的社会决定因素、既往病史、药物、心电图(EKG)和超声心动图(Echo)),并逐步测试其预测AF入院的能力。结果:在整个队列中,4751例(2.5%)患者在35个月的中位随访时间内因房颤入院。增量EHR域增加了所有ML分类器的接受者-操作者曲线(AUROC)下的面积,当包括所有域时,Gradient Boosting实现了0.85的AUROC,但在最大约登指数下的F1得分很差,为14%。单独使用EKG和Echo域与包括所有EHR域时的性能相当。这些结果经过外部验证。结论:更多的结构化EHR域提高了预测房颤入院的能力,但结构化EKG和Echo域的收益最大。虽然ML模型具有良好的判别能力,但由于事件率低,精度较差。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.20
自引率
5.70%
发文量
297
审稿时长
1 months
期刊介绍: BMC Medical Informatics and Decision Making is an open access journal publishing original peer-reviewed research articles in relation to the design, development, implementation, use, and evaluation of health information technologies and decision-making for human health.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信