Preparing for future pandemics: Automated intensive care electronic health record data extraction to accelerate clinical insights

Lada Lijović , Harm Jan de Grooth , Patrick Thoral , Lieuwe Bos , Zheng Feng , Tomislav Radočaj , Paul Elbers
{"title":"Preparing for future pandemics: Automated intensive care electronic health record data extraction to accelerate clinical insights","authors":"Lada Lijović ,&nbsp;Harm Jan de Grooth ,&nbsp;Patrick Thoral ,&nbsp;Lieuwe Bos ,&nbsp;Zheng Feng ,&nbsp;Tomislav Radočaj ,&nbsp;Paul Elbers","doi":"10.1016/j.jointm.2024.10.003","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Manual data abstraction from electronic health records (EHRs) for research on intensive care patients is time-intensive and challenging, especially during high-pressure periods such as pandemics. Automated data extraction is a potential alternative but may raise quality concerns. This study assessed the feasibility and credibility of automated data extraction during the coronavirus disease 2019 (COVID-19) pandemic.</div></div><div><h3>Methods</h3><div>We retrieved routinely collected data from the COVID-Predict Dutch Data Warehouse, a multicenter database containing the following data on intensive care patients with COVID-19: demographic, medication, laboratory results, and data from monitoring and life support devices. These data were sourced from EHRs using automated data extraction. We used these data to determine indices of wasted ventilation and their prognostic value and compared our findings to a previously published original study that relied on manual data abstraction largely from the same hospitals.</div></div><div><h3>Results</h3><div>Using automatically extracted data, we replicated the original study. Among 1515 patients intubated for over 2 days, Harris–Benedict (HB) estimates of dead space fraction increased over time and were higher in non-survivors at each time point: at the start of ventilation (0.70±0.13 <em>vs</em>. 0.67±0.15, <em>P</em> &lt;0.001), day 1 (0.74±0.10 <em>vs</em>. 0.71±0.11, <em>P</em>&lt;0.001), day 2 (0.77±0.09 <em>vs</em>. 0.73±0.11, <em>P</em>&lt;0.001), and day 3 (0.78±0.09 <em>vs</em>. 0.74±0.10, <em>P</em>&lt;0.001). Patients with HB dead space fraction above the median had an increased mortality rate of 13.5%, compared to 10.1% in those with values below the median (<em>P</em>&lt;0.005). Ventilatory ratio showed similar trends, with mortality increasing from 10.8% to 12.9% (<em>P</em>=0.040). Conversely, the end-tidal-to-arterial partial pressure of carbon dioxide (PaCO₂) ratio was inversely related to mortality, with a lower 28-day mortality in the higher than median group (8.5% <em>vs</em>. 15.1%, <em>P</em>&lt;0.001). After adjusting for base risk, impaired ventilation markers showed no significant association with 28-day mortality.</div></div><div><h3>Conclusion</h3><div>Manual data abstraction from EHRs may be unnecessary for reliable research on intensive care patients, highlighting the feasibility and credibility of automated data extraction as a trustworthy and scalable solution to accelerate clinical insights, especially during future pandemics.</div></div>","PeriodicalId":73799,"journal":{"name":"Journal of intensive medicine","volume":"5 2","pages":"Pages 167-175"},"PeriodicalIF":0.0000,"publicationDate":"2024-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of intensive medicine","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2667100X24001051","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Background

Manual data abstraction from electronic health records (EHRs) for research on intensive care patients is time-intensive and challenging, especially during high-pressure periods such as pandemics. Automated data extraction is a potential alternative but may raise quality concerns. This study assessed the feasibility and credibility of automated data extraction during the coronavirus disease 2019 (COVID-19) pandemic.

Methods

We retrieved routinely collected data from the COVID-Predict Dutch Data Warehouse, a multicenter database containing the following data on intensive care patients with COVID-19: demographic, medication, laboratory results, and data from monitoring and life support devices. These data were sourced from EHRs using automated data extraction. We used these data to determine indices of wasted ventilation and their prognostic value and compared our findings to a previously published original study that relied on manual data abstraction largely from the same hospitals.

Results

Using automatically extracted data, we replicated the original study. Among 1515 patients intubated for over 2 days, Harris–Benedict (HB) estimates of dead space fraction increased over time and were higher in non-survivors at each time point: at the start of ventilation (0.70±0.13 vs. 0.67±0.15, P <0.001), day 1 (0.74±0.10 vs. 0.71±0.11, P<0.001), day 2 (0.77±0.09 vs. 0.73±0.11, P<0.001), and day 3 (0.78±0.09 vs. 0.74±0.10, P<0.001). Patients with HB dead space fraction above the median had an increased mortality rate of 13.5%, compared to 10.1% in those with values below the median (P<0.005). Ventilatory ratio showed similar trends, with mortality increasing from 10.8% to 12.9% (P=0.040). Conversely, the end-tidal-to-arterial partial pressure of carbon dioxide (PaCO₂) ratio was inversely related to mortality, with a lower 28-day mortality in the higher than median group (8.5% vs. 15.1%, P<0.001). After adjusting for base risk, impaired ventilation markers showed no significant association with 28-day mortality.

Conclusion

Manual data abstraction from EHRs may be unnecessary for reliable research on intensive care patients, highlighting the feasibility and credibility of automated data extraction as a trustworthy and scalable solution to accelerate clinical insights, especially during future pandemics.
为未来的流行病做准备:自动化重症监护电子健康记录数据提取,以加速临床洞察
从电子健康记录(EHRs)中手动提取数据用于重症监护患者的研究是一项耗时且具有挑战性的工作,特别是在流行病等高压时期。自动数据提取是一个潜在的替代方案,但可能会引起质量问题。本研究评估了2019冠状病毒病(COVID-19)大流行期间自动数据提取的可行性和可信度。方法我们从COVID-Predict荷兰数据仓库(一个多中心数据库)中检索常规收集的数据,该数据库包含以下COVID-19重症监护患者的数据:人口统计学、药物、实验室结果以及监测和生命支持设备的数据。这些数据来自使用自动数据提取的电子病历。我们使用这些数据来确定浪费通气的指标及其预后价值,并将我们的发现与先前发表的一项原始研究进行比较,该研究主要依赖于来自同一家医院的人工数据提取。结果使用自动提取的数据,我们重复了原始研究。在1515例插管超过2天的患者中,哈里斯-本尼迪克特(HB)估计的死亡空间分数随着时间的推移而增加,在每个时间点,非幸存者的死亡空间分数更高:通气开始时(0.70±0.13比0.67±0.15,P<0.001),第1天(0.74±0.10比0.71±0.11,P<0.001),第2天(0.77±0.09比0.73±0.11,P<0.001),第3天(0.78±0.09比0.74±0.10,P<0.001)。HB死亡空间分数高于中位数的患者死亡率增加13.5%,而低于中位数的患者死亡率增加10.1% (P<0.005)。通气量变化趋势相似,死亡率由10.8%上升至12.9% (P=0.040)。相反,尾潮-动脉二氧化碳分压(PaCO₂)比与死亡率呈负相关,高中位数组28天死亡率较低(8.5%比15.1%,P<0.001)。在调整基础风险后,通气指标受损与28天死亡率无显著关联。结论从电子病历中手动提取数据对于重症监护患者的可靠研究可能是不必要的,这突出了自动化数据提取作为一种值得信赖和可扩展的解决方案的可行性和可信度,以加速临床洞察,特别是在未来的大流行期间。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of intensive medicine
Journal of intensive medicine Critical Care and Intensive Care Medicine
CiteScore
1.90
自引率
0.00%
发文量
0
审稿时长
58 days
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信