Harnessing temporal patterns in administrative patient data to predict risk of emergency hospital admission

IF 23.8 1区 医学 Q1 MEDICAL INFORMATICS
Benjamin Post MBChB , Roman Klapaukh PhD , Prof Stephen J Brett MD , Prof A Aldo Faisal PhD
{"title":"Harnessing temporal patterns in administrative patient data to predict risk of emergency hospital admission","authors":"Benjamin Post MBChB ,&nbsp;Roman Klapaukh PhD ,&nbsp;Prof Stephen J Brett MD ,&nbsp;Prof A Aldo Faisal PhD","doi":"10.1016/S2589-7500(24)00254-1","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Unplanned hospital admissions are associated with worse patient outcomes and cause strain on health systems worldwide. Primary care electronic health records (EHRs) have successfully been used to create prediction models for emergency hospitalisation, but these approaches require a broad range of diagnostic, physiological, and laboratory values. In this study, we aimed to capture temporal patterns of patient activity from EHR data and evaluate their effectiveness in predicting emergency hospital admissions compared with conventional methods.</div></div><div><h3>Methods</h3><div>In this retrospective observational study, we used the Secure Anonymised Information Linkage databank to extract temporal patterns of primary care activity from undifferentiated electronic health record timestamp data for 1·37 million patients in Wales aged 18–80 years with at least one recorded Read code between the years 2016 and 2018. Using Gaussian mixture modelling we grouped patients into distinct temporal clusters, performed a three-stage validation of our approach and calculated the risk of emergency hospital admission for each temporal cluster group. Finally, these temporal clusters were combined with five administrative variables and incorporated into four emergency hospital admission prediction models (logistic regression, naive Bayes, XGBoost, and multilayer perceptron [MLP]) and compared with a more traditional, but data-intensive, modelling technique. The primary outcome was emergency hospital admission as the next health-care event.</div></div><div><h3>Findings</h3><div>Six distinct temporal cluster patterns of primary care EHR activity were identified, associated with varying risks of future emergency hospital admission risk. These patterns were visually interpretable, repeatable at a population-level, and clinically plausible. The best emergency hospital admission prediction model (MLP) achieved an area under the receiver operating characteristic (AUROC) of 0·82 and precision of 0·94 in regional cohorts. In external validation in regional cohorts, similar model performance was observed (AUROC 0·82 and precision 0·92). This model also matched the performance of a more complex model (extended feature model) requiring 33 clinical parameters (AUROC 0·82 <em>vs</em> 0·83; precision 0·94 <em>vs</em> 0·90) for the same task on the same dataset.</div></div><div><h3>Interpretation</h3><div>We developed a novel machine learning pipeline that extracts interpretable temporal patterns from simple representations of EHR data and can be incorporated into emergency hospital admission predictors. This framework might enable more rapid development of parsimonious clinical prediction models.</div></div><div><h3>Funding</h3><div>UKRI CDT in AI for Healthcare, UKRI Turing AI Fellowship, NIHR Imperial Biomedical Research Centre, and Research Capability Funding.</div></div>","PeriodicalId":48534,"journal":{"name":"Lancet Digital Health","volume":"7 2","pages":"Pages e124-e135"},"PeriodicalIF":23.8000,"publicationDate":"2025-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Lancet Digital Health","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2589750024002541","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0

Abstract

Background

Unplanned hospital admissions are associated with worse patient outcomes and cause strain on health systems worldwide. Primary care electronic health records (EHRs) have successfully been used to create prediction models for emergency hospitalisation, but these approaches require a broad range of diagnostic, physiological, and laboratory values. In this study, we aimed to capture temporal patterns of patient activity from EHR data and evaluate their effectiveness in predicting emergency hospital admissions compared with conventional methods.

Methods

In this retrospective observational study, we used the Secure Anonymised Information Linkage databank to extract temporal patterns of primary care activity from undifferentiated electronic health record timestamp data for 1·37 million patients in Wales aged 18–80 years with at least one recorded Read code between the years 2016 and 2018. Using Gaussian mixture modelling we grouped patients into distinct temporal clusters, performed a three-stage validation of our approach and calculated the risk of emergency hospital admission for each temporal cluster group. Finally, these temporal clusters were combined with five administrative variables and incorporated into four emergency hospital admission prediction models (logistic regression, naive Bayes, XGBoost, and multilayer perceptron [MLP]) and compared with a more traditional, but data-intensive, modelling technique. The primary outcome was emergency hospital admission as the next health-care event.

Findings

Six distinct temporal cluster patterns of primary care EHR activity were identified, associated with varying risks of future emergency hospital admission risk. These patterns were visually interpretable, repeatable at a population-level, and clinically plausible. The best emergency hospital admission prediction model (MLP) achieved an area under the receiver operating characteristic (AUROC) of 0·82 and precision of 0·94 in regional cohorts. In external validation in regional cohorts, similar model performance was observed (AUROC 0·82 and precision 0·92). This model also matched the performance of a more complex model (extended feature model) requiring 33 clinical parameters (AUROC 0·82 vs 0·83; precision 0·94 vs 0·90) for the same task on the same dataset.

Interpretation

We developed a novel machine learning pipeline that extracts interpretable temporal patterns from simple representations of EHR data and can be incorporated into emergency hospital admission predictors. This framework might enable more rapid development of parsimonious clinical prediction models.

Funding

UKRI CDT in AI for Healthcare, UKRI Turing AI Fellowship, NIHR Imperial Biomedical Research Centre, and Research Capability Funding.
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
41.20
自引率
1.60%
发文量
232
审稿时长
13 weeks
期刊介绍: The Lancet Digital Health publishes important, innovative, and practice-changing research on any topic connected with digital technology in clinical medicine, public health, and global health. The journal’s open access content crosses subject boundaries, building bridges between health professionals and researchers.By bringing together the most important advances in this multidisciplinary field,The Lancet Digital Health is the most prominent publishing venue in digital health. We publish a range of content types including Articles,Review, Comment, and Correspondence, contributing to promoting digital technologies in health practice worldwide.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信