Utilising routinely collected clinical data through time series deep learning to improve identification of bacterial bloodstream infections: a retrospective cohort study

IF 23.8 1区 医学 Q1 MEDICAL INFORMATICS
Damien K Ming PhD , Vasin Vasikasin PhD , Timothy M Rawson PhD , Prof Pantelis Georgiou PhD , Frances J Davies PhD , Prof Alison H Holmes FMedSci , Bernard Hernandez PhD
{"title":"Utilising routinely collected clinical data through time series deep learning to improve identification of bacterial bloodstream infections: a retrospective cohort study","authors":"Damien K Ming PhD ,&nbsp;Vasin Vasikasin PhD ,&nbsp;Timothy M Rawson PhD ,&nbsp;Prof Pantelis Georgiou PhD ,&nbsp;Frances J Davies PhD ,&nbsp;Prof Alison H Holmes FMedSci ,&nbsp;Bernard Hernandez PhD","doi":"10.1016/j.landig.2025.01.010","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Blood cultures are the gold standard for diagnosing bacterial bloodstream infections, but test results are only available 24–48 h after sampling. We aimed to develop and evaluate models using health-care data to predict bloodstream infections in patients admitted to hospital.</div></div><div><h3>Methods</h3><div>In this retrospective cohort study, we used routinely collected blood biomarkers and demographic data from patients who underwent blood sample collection for testing via culture between March 3, 2014, and Dec 1, 2021, at Imperial College Healthcare NHS Trust (London, UK) as model features. Data up to 14 days before blood sample collection were provided to long short-term memory (LSTM) or static logistic regression models. The primary outcome was prediction of blood culture results, defined as a pathogenic bloodstream infection (ie, isolation of pathogenic bacteria of interest) or no bloodstream infection (ie, no growth or contamination). Data collected up to Feb 28, 2021 (n=15 212) comprised the training set and were evaluated against a temporal hold-out test set comprising patients who were sampled after March 1, 2021 (n=5638).</div></div><div><h3>Findings</h3><div>Among 20 850 patients with available data, pathogenic bacteria were observed in the cultured blood samples of 3866 (18·5%) patients. 2920 (62·2%) of 4897 patients who had their blood samples taken more than 48 h after admission to hospital had pathogenic bloodstream infections, and so were defined as having hospital-acquired bloodstream infections. Including data from the 7 days before admission (7-day window approach) and using five-fold cross validation in the training set gave an area under receiver operator curve (AUROC) of 0·75 (IQR 0·68–0·82) and an area under the precision recall curve (AUPRC) of 0·58 (0·46–0·77) for static models and an AUROC of 0·92 (0·91–0·93) and AUPRC of 0·75 (0·72–0·76) for the LSTM model. In the hold-out test set performances were: AUROC of 0·74 (95% CI 0·70–0·78) and AUPRC of 0·48 (0·43–0·53) for static models and AUROC of 0·97 (0·96–0·97) and AUPRC of 0·65 (0·60–0·70) for LSTM. Removal of time series information resulted in lower model performance, particularly for hospital-acquired bloodstream infections. Dynamics of C-reactive protein concentration, eosinophil count, and platelet count were important features for prediction of blood culture results.</div></div><div><h3>Interpretation</h3><div>Deep learning models accounting for longitudinal changes could support individualised clinical decision making for patients at risk of bloodstream infections. Appropriate implementation into existing diagnostic pathways could enhance diagnostic stewardship and reduce unnecessary antimicrobial prescribing.</div></div><div><h3>Funding</h3><div>UK Department of Health and Social Care, the National Institute for Health and Care Research, and the Wellcome Trust.</div></div>","PeriodicalId":48534,"journal":{"name":"Lancet Digital Health","volume":"7 3","pages":"Pages e205-e215"},"PeriodicalIF":23.8000,"publicationDate":"2025-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Lancet Digital Health","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S258975002500010X","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0

Abstract

Background

Blood cultures are the gold standard for diagnosing bacterial bloodstream infections, but test results are only available 24–48 h after sampling. We aimed to develop and evaluate models using health-care data to predict bloodstream infections in patients admitted to hospital.

Methods

In this retrospective cohort study, we used routinely collected blood biomarkers and demographic data from patients who underwent blood sample collection for testing via culture between March 3, 2014, and Dec 1, 2021, at Imperial College Healthcare NHS Trust (London, UK) as model features. Data up to 14 days before blood sample collection were provided to long short-term memory (LSTM) or static logistic regression models. The primary outcome was prediction of blood culture results, defined as a pathogenic bloodstream infection (ie, isolation of pathogenic bacteria of interest) or no bloodstream infection (ie, no growth or contamination). Data collected up to Feb 28, 2021 (n=15 212) comprised the training set and were evaluated against a temporal hold-out test set comprising patients who were sampled after March 1, 2021 (n=5638).

Findings

Among 20 850 patients with available data, pathogenic bacteria were observed in the cultured blood samples of 3866 (18·5%) patients. 2920 (62·2%) of 4897 patients who had their blood samples taken more than 48 h after admission to hospital had pathogenic bloodstream infections, and so were defined as having hospital-acquired bloodstream infections. Including data from the 7 days before admission (7-day window approach) and using five-fold cross validation in the training set gave an area under receiver operator curve (AUROC) of 0·75 (IQR 0·68–0·82) and an area under the precision recall curve (AUPRC) of 0·58 (0·46–0·77) for static models and an AUROC of 0·92 (0·91–0·93) and AUPRC of 0·75 (0·72–0·76) for the LSTM model. In the hold-out test set performances were: AUROC of 0·74 (95% CI 0·70–0·78) and AUPRC of 0·48 (0·43–0·53) for static models and AUROC of 0·97 (0·96–0·97) and AUPRC of 0·65 (0·60–0·70) for LSTM. Removal of time series information resulted in lower model performance, particularly for hospital-acquired bloodstream infections. Dynamics of C-reactive protein concentration, eosinophil count, and platelet count were important features for prediction of blood culture results.

Interpretation

Deep learning models accounting for longitudinal changes could support individualised clinical decision making for patients at risk of bloodstream infections. Appropriate implementation into existing diagnostic pathways could enhance diagnostic stewardship and reduce unnecessary antimicrobial prescribing.

Funding

UK Department of Health and Social Care, the National Institute for Health and Care Research, and the Wellcome Trust.
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
41.20
自引率
1.60%
发文量
232
审稿时长
13 weeks
期刊介绍: The Lancet Digital Health publishes important, innovative, and practice-changing research on any topic connected with digital technology in clinical medicine, public health, and global health. The journal’s open access content crosses subject boundaries, building bridges between health professionals and researchers.By bringing together the most important advances in this multidisciplinary field,The Lancet Digital Health is the most prominent publishing venue in digital health. We publish a range of content types including Articles,Review, Comment, and Correspondence, contributing to promoting digital technologies in health practice worldwide.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信