Implicit bias in ICU electronic health record data: measurement frequencies and missing data rates of clinical variables.

IF 3.8 3区 医学 Q2 MEDICAL INFORMATICS
Junming Shi, Alan E Hubbard, Nicholas Fong, Romain Pirracchio
{"title":"Implicit bias in ICU electronic health record data: measurement frequencies and missing data rates of clinical variables.","authors":"Junming Shi, Alan E Hubbard, Nicholas Fong, Romain Pirracchio","doi":"10.1186/s12911-025-03058-9","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Systematic disparities in data collection within electronic health records (EHRs), defined as non-random patterns in the measurement and recording of clinical variables across demographic groups, can be reflective of underlying implicit bias and may affect patient outcome. Identifying and mitigating these biases is critical for ensuring equitable healthcare. This study aims to develop an analytical framework for measurement patterns, defined as the combination of measurement frequency (how often variables are collected) and missing data rates (the frequency of missing recordings), evaluate the association between them and demographic factors, and assess their impact on in-hospital mortality prediction.</p><p><strong>Methods: </strong>We conducted a retrospective cohort study using the Medical Information Mart for Intensive Care III (MIMIC-III) database, which includes data on over 40,000 ICU patients from Beth Israel Deaconess Medical Center (2001-2012). Adult patients with ICU stays longer than 24 h were included. Measurement patterns, including missing data rates and measurement frequencies, were derived from EHR data and analyzed. Targeted Machine Learning (TML) methods were used to assess potential systematic disparities in measurement patterns across demographic factors (age, gender, race/ethnicity) while controlling for confounders such as other demographics and disease severity. The predictive power of measurement patterns on in-hospital mortality was evaluated.</p><p><strong>Results: </strong>Among 23,426 patients, significant demographic systematic disparities were observed in the first 24 h of ICU stays. Elderly patients (≥ 65 years) had more frequent temperature measurements compared to younger patients, while males had slightly fewer missing temperature measurements than females. Racial disparities were notable: White patients had more frequent blood pressure and oxygen saturation (SpO2) measurements compared to Black and Hispanic patients. Measurement patterns were associated with ICU mortality, with models based solely on these patterns achieving an area under the receiver operating characteristic curve (AUC) of 0.76 (95% CI: 0.74-0.77).</p><p><strong>Conclusions: </strong>This study underscores the significance of measurement patterns in ICU EHR data, which are associated with patient demographics and ICU mortality. Analyzing patterns of missing data and measurement frequencies provides valuable insights into patient monitoring practices and potential systemic disparities in healthcare delivery. Understanding these disparities is critical for improving the fairness of healthcare delivery and developing more accurate predictive models in critical care settings.</p><p><strong>Clinical trial number: </strong>Not applicable.</p>","PeriodicalId":9340,"journal":{"name":"BMC Medical Informatics and Decision Making","volume":"25 1","pages":"241"},"PeriodicalIF":3.8000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12220764/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Informatics and Decision Making","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12911-025-03058-9","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Systematic disparities in data collection within electronic health records (EHRs), defined as non-random patterns in the measurement and recording of clinical variables across demographic groups, can be reflective of underlying implicit bias and may affect patient outcome. Identifying and mitigating these biases is critical for ensuring equitable healthcare. This study aims to develop an analytical framework for measurement patterns, defined as the combination of measurement frequency (how often variables are collected) and missing data rates (the frequency of missing recordings), evaluate the association between them and demographic factors, and assess their impact on in-hospital mortality prediction.

Methods: We conducted a retrospective cohort study using the Medical Information Mart for Intensive Care III (MIMIC-III) database, which includes data on over 40,000 ICU patients from Beth Israel Deaconess Medical Center (2001-2012). Adult patients with ICU stays longer than 24 h were included. Measurement patterns, including missing data rates and measurement frequencies, were derived from EHR data and analyzed. Targeted Machine Learning (TML) methods were used to assess potential systematic disparities in measurement patterns across demographic factors (age, gender, race/ethnicity) while controlling for confounders such as other demographics and disease severity. The predictive power of measurement patterns on in-hospital mortality was evaluated.

Results: Among 23,426 patients, significant demographic systematic disparities were observed in the first 24 h of ICU stays. Elderly patients (≥ 65 years) had more frequent temperature measurements compared to younger patients, while males had slightly fewer missing temperature measurements than females. Racial disparities were notable: White patients had more frequent blood pressure and oxygen saturation (SpO2) measurements compared to Black and Hispanic patients. Measurement patterns were associated with ICU mortality, with models based solely on these patterns achieving an area under the receiver operating characteristic curve (AUC) of 0.76 (95% CI: 0.74-0.77).

Conclusions: This study underscores the significance of measurement patterns in ICU EHR data, which are associated with patient demographics and ICU mortality. Analyzing patterns of missing data and measurement frequencies provides valuable insights into patient monitoring practices and potential systemic disparities in healthcare delivery. Understanding these disparities is critical for improving the fairness of healthcare delivery and developing more accurate predictive models in critical care settings.

Clinical trial number: Not applicable.

ICU电子病历数据的内隐偏倚:临床变量的测量频率和数据缺失率。
背景:电子健康记录(EHRs)中数据收集的系统性差异,被定义为跨人口统计学群体测量和记录临床变量的非随机模式,可以反映潜在的隐性偏见,并可能影响患者的结果。识别和减轻这些偏见对于确保公平的医疗保健至关重要。本研究旨在建立一个测量模式的分析框架,定义为测量频率(收集变量的频率)和缺失数据率(缺失记录的频率)的组合,评估它们与人口因素之间的关联,并评估它们对院内死亡率预测的影响。方法:我们使用重症监护医学信息市场III (MIMIC-III)数据库进行了一项回顾性队列研究,该数据库包括贝斯以色列女执事医疗中心(Beth Israel Deaconess Medical Center) 2001-2012年超过40,000名ICU患者的数据。纳入ICU住院时间超过24 h的成年患者。测量模式,包括缺失数据率和测量频率,从电子病历数据中导出并分析。目标机器学习(TML)方法用于评估不同人口因素(年龄、性别、种族/民族)测量模式的潜在系统性差异,同时控制其他人口统计学和疾病严重程度等混杂因素。评估测量模式对住院死亡率的预测能力。结果:在23,426例患者中,在ICU住院的前24小时观察到显著的人口统计学系统差异。与年轻患者相比,老年患者(≥65岁)有更频繁的体温测量,而男性的体温测量缺失略少于女性。种族差异明显:白人患者比黑人和西班牙裔患者更频繁地测量血压和血氧饱和度(SpO2)。测量模式与ICU死亡率相关,仅基于这些模式的模型的受试者工作特征曲线下面积(AUC)为0.76 (95% CI: 0.74-0.77)。结论:本研究强调了ICU电子病历数据测量模式的重要性,这些数据与患者人口统计学和ICU死亡率相关。分析缺失数据的模式和测量频率为了解患者监测实践和医疗保健服务中潜在的系统性差异提供了有价值的见解。了解这些差异对于提高医疗保健服务的公平性和在重症护理环境中开发更准确的预测模型至关重要。临床试验号:不适用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.20
自引率
5.70%
发文量
297
审稿时长
1 months
期刊介绍: BMC Medical Informatics and Decision Making is an open access journal publishing original peer-reviewed research articles in relation to the design, development, implementation, use, and evaluation of health information technologies and decision-making for human health.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信