Electronic Health Records for Population Health Management: Comparison of Electronic Health Record-Derived Hypertension Prevalence Measures Against Established Survey Data.

IF 1.1

Online journal of public health informatics Pub Date : 2024-03-13 DOI:10.2196/48300

Katie S Allen, Nimish Valvi, P Joseph Gibson, Timothy McFarlane, Brian E Dixon

{"title":"Electronic Health Records for Population Health Management: Comparison of Electronic Health Record-Derived Hypertension Prevalence Measures Against Established Survey Data.","authors":"Katie S Allen, Nimish Valvi, P Joseph Gibson, Timothy McFarlane, Brian E Dixon","doi":"10.2196/48300","DOIUrl":null,"url":null,"abstract":"Background: Hypertension is the most prevalent risk factor for mortality globally. Uncontrolled hypertension is associated with excess morbidity and mortality, and nearly one-half of individuals with hypertension do not have the condition under control. Data from electronic health record (EHR) systems may be useful for community hypertension surveillance, filling a gap in local public health departments' community health assessments and supporting the public health data modernization initiatives currently underway. To identify patients with hypertension, computable phenotypes are required. These phenotypes leverage available data elements-such as vitals measurements and medications-to identify patients diagnosed with hypertension. However, there are multiple methodologies for creating a phenotype, and the identification of which method most accurately reflects real-world prevalence rates is needed to support data modernization initiatives.Objective: This study sought to assess the comparability of 6 different EHR-based hypertension prevalence estimates with estimates from a national survey. Each of the prevalence estimates was created using a different computable phenotype. The overarching goal is to identify which phenotypes most closely align with nationally accepted estimations.Methods: Using the 6 different EHR-based computable phenotypes, we calculated hypertension prevalence estimates for Marion County, Indiana, for the period from 2014 to 2015. We extracted hypertension rates from the Behavioral Risk Factor Surveillance System (BRFSS) for the same period. We used the two 1-sided t test (TOST) to test equivalence between BRFSS- and EHR-based prevalence estimates. The TOST was performed at the overall level as well as stratified by age, gender, and race.Results: Using both 80% and 90% CIs, the TOST analysis resulted in 2 computable phenotypes demonstrating rough equivalence to BRFSS estimates. Variation in performance was noted across phenotypes as well as demographics. TOST with 80% CIs demonstrated that the phenotypes had less variance compared to BRFSS estimates within subpopulations, particularly those related to racial categories. Overall, less variance occurred on phenotypes that included vitals measurements.Conclusions: This study demonstrates that certain EHR-derived prevalence estimates may serve as rough substitutes for population-based survey estimates. These outcomes demonstrate the importance of critically assessing which data elements to include in EHR-based computer phenotypes. Using comprehensive data sources, containing complete clinical data as well as data representative of the population, are crucial to producing robust estimates of chronic disease. As public health departments look toward data modernization activities, the EHR may serve to assist in more timely, locally representative estimates for chronic disease prevalence.","PeriodicalId":74345,"journal":{"name":"Online journal of public health informatics","volume":"16 ","pages":"e48300"},"PeriodicalIF":1.1000,"publicationDate":"2024-03-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10973965/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Online journal of public health informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2196/48300","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Hypertension is the most prevalent risk factor for mortality globally. Uncontrolled hypertension is associated with excess morbidity and mortality, and nearly one-half of individuals with hypertension do not have the condition under control. Data from electronic health record (EHR) systems may be useful for community hypertension surveillance, filling a gap in local public health departments' community health assessments and supporting the public health data modernization initiatives currently underway. To identify patients with hypertension, computable phenotypes are required. These phenotypes leverage available data elements-such as vitals measurements and medications-to identify patients diagnosed with hypertension. However, there are multiple methodologies for creating a phenotype, and the identification of which method most accurately reflects real-world prevalence rates is needed to support data modernization initiatives.

Objective: This study sought to assess the comparability of 6 different EHR-based hypertension prevalence estimates with estimates from a national survey. Each of the prevalence estimates was created using a different computable phenotype. The overarching goal is to identify which phenotypes most closely align with nationally accepted estimations.

Methods: Using the 6 different EHR-based computable phenotypes, we calculated hypertension prevalence estimates for Marion County, Indiana, for the period from 2014 to 2015. We extracted hypertension rates from the Behavioral Risk Factor Surveillance System (BRFSS) for the same period. We used the two 1-sided t test (TOST) to test equivalence between BRFSS- and EHR-based prevalence estimates. The TOST was performed at the overall level as well as stratified by age, gender, and race.

Results: Using both 80% and 90% CIs, the TOST analysis resulted in 2 computable phenotypes demonstrating rough equivalence to BRFSS estimates. Variation in performance was noted across phenotypes as well as demographics. TOST with 80% CIs demonstrated that the phenotypes had less variance compared to BRFSS estimates within subpopulations, particularly those related to racial categories. Overall, less variance occurred on phenotypes that included vitals measurements.

Conclusions: This study demonstrates that certain EHR-derived prevalence estimates may serve as rough substitutes for population-based survey estimates. These outcomes demonstrate the importance of critically assessing which data elements to include in EHR-based computer phenotypes. Using comprehensive data sources, containing complete clinical data as well as data representative of the population, are crucial to producing robust estimates of chronic disease. As public health departments look toward data modernization activities, the EHR may serve to assist in more timely, locally representative estimates for chronic disease prevalence.

查看原文本刊更多论文

用于人口健康管理的电子健康记录：将电子健康记录得出的高血压患病率指标与已有的调查数据进行比较。

背景：高血压是全球最普遍的死亡风险因素。未得到控制的高血压与高发病率和高死亡率有关，近二分之一的高血压患者病情未得到控制。来自电子健康记录（EHR）系统的数据可用于社区高血压监测，填补地方公共卫生部门社区健康评估的空白，并支持目前正在进行的公共卫生数据现代化计划。要识别高血压患者，需要可计算的表型。这些表型利用现有的数据元素（如生命体征测量和药物治疗）来识别被诊断为高血压的患者。然而，创建表型有多种方法，需要确定哪种方法能最准确地反映真实世界的患病率，以支持数据现代化计划：本研究旨在评估 6 种不同的基于电子病历的高血压患病率估计值与一项全国性调查的估计值之间的可比性。每种患病率估计值均采用不同的可计算表型。总体目标是确定哪些表型与全国公认的估计值最为接近：使用 6 种不同的基于电子病历的可计算表型，我们计算了印第安纳州马里恩县 2014 年至 2015 年期间的高血压患病率估计值。我们从行为风险因素监测系统（BRFSS）中提取了同期的高血压发病率。我们使用两个单侧 t 检验（TOST）来检验基于 BRFSS 和基于电子病历的患病率估计值之间的等效性。TOST 在总体水平上进行，并按年龄、性别和种族进行分层：结果：使用 80% 和 90% CIs，TOST 分析得出了两个可计算的表型，与 BRFSS 估计值大致相当。不同表型和不同人口统计学特征的表现存在差异。具有 80% CIs 的 TOST 表明，与 BRFSS 估计值相比，表型在亚人群中的差异较小，尤其是与种族类别相关的表型。总体而言，包括生命体征测量在内的表型差异较小：本研究表明，某些由电子病历得出的患病率估计值可粗略替代基于人群的调查估计值。这些结果表明，必须严格评估在基于电子病历的计算机表型中包含哪些数据元素。使用包含完整临床数据和人口代表性数据的综合数据源，对于得出可靠的慢性病估计值至关重要。随着公共卫生部门着眼于数据现代化活动，电子病历可能有助于更及时地估算出具有地方代表性的慢性病患病率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Online journal of public health informatics

自引率

0.00%

发文量

审稿时长

10 weeks