在被称为国家临床队列协作的全国抽样电子健康记录库中识别艾滋病毒感染者或有风险的人:计算表型研究。

IF 3.1 3区 医学 Q2 MEDICAL INFORMATICS
Eric Hurwitz, Cara D Varley, A Jerrod Anzolone, Vithal Madhira, Amy L Olex, Jing Sun, Dimple Vaidya, Nada Fadul, Jessica Y Islam, Lesley E Jackson, Kenneth J Wilkins, Zachary Butzin-Dozier, Dongmei Li, Sandra E Safo, Julie A McMurry, Pooja Maheria, Tommy Williams, Shukri A Hassan, Melissa A Haendel, Rena C Patel
{"title":"在被称为国家临床队列协作的全国抽样电子健康记录库中识别艾滋病毒感染者或有风险的人:计算表型研究。","authors":"Eric Hurwitz, Cara D Varley, A Jerrod Anzolone, Vithal Madhira, Amy L Olex, Jing Sun, Dimple Vaidya, Nada Fadul, Jessica Y Islam, Lesley E Jackson, Kenneth J Wilkins, Zachary Butzin-Dozier, Dongmei Li, Sandra E Safo, Julie A McMurry, Pooja Maheria, Tommy Williams, Shukri A Hassan, Melissa A Haendel, Rena C Patel","doi":"10.2196/68143","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Electronic health records (EHRs) provide valuable insights to address clinical and epidemiological research concerning HIV, including the disproportionate impact of the COVID-19 pandemic on people living with HIV. To identify this population, most studies using EHR or claims databases start with diagnostic codes, which can result in misclassification without further refinement using drug or laboratory data. Furthermore, given that antiretrovirals now have indications for both HIV and COVID-19 (ie, ritonavir in nirmatrelvir/ritonavir), new phenotyping methods are needed to better capture people living with HIV. Therefore, we created a generalizable and innovative method to robustly identify people living with HIV, preexposure prophylaxis (PrEP) users, postexposure prophylaxis (PEP) users, and people not living with HIV using granular clinical data after the emergence of COVID-19.</p><p><strong>Objective: </strong>The primary aim of this study was to use computational phenotyping in EHR data to identify people living with HIV (cohort 1), PrEP users (cohort 2), PEP users (cohort 3), or \"none of the above\" (people not living with HIV; cohort 4) and describe COVID-19-related characteristics among these cohorts.</p><p><strong>Methods: </strong>We used diagnostic and laboratory measurements and drug concepts in the National Clinical Cohort Collaborative to create a computational phenotype for the 4 cohorts with confidence levels. For robustness, we conducted a randomly sampled, blinded clinician annotation to assess precision. We calculated the distribution of demographics, comorbidities, and COVID-19 variables among the 4 cohorts.</p><p><strong>Results: </strong>We identified 132,664 people living with HIV with a high level of confidence, 36,088 PrEP users, 4120 PEP users, and 20,639,675 people not living with HIV. Most people living with HIV were identified by a combination of medical conditions, laboratory measurements, and drug exposures (74,809/132,664, 56.4%), followed by laboratory measurements and drug exposures (15,241/132,664, 11.5%) and then by medical conditions and drug exposures (14,595/132,664, 11%). A higher proportion of people living with HIV experienced COVID-19-related hospitalization (4650,132,664, 3.5%) or mortality (828/132,664, 0.6%) and all-cause mortality (2083/132,664, 1.6%) compared to other cohorts.</p><p><strong>Conclusions: </strong>Using an extensive phenotyping algorithm leveraging granular data in an EHR repository, we have identified people living with HIV, people not living with HIV, PrEP users, and PEP users. Our findings offer transferable lessons to optimize future EHR phenotyping for these cohorts.</p>","PeriodicalId":56334,"journal":{"name":"JMIR Medical Informatics","volume":"13 ","pages":"e68143"},"PeriodicalIF":3.1000,"publicationDate":"2025-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Identifying People Living With or Those at Risk for HIV in a Nationally Sampled Electronic Health Record Repository Called the National Clinical Cohort Collaborative: Computational Phenotyping Study.\",\"authors\":\"Eric Hurwitz, Cara D Varley, A Jerrod Anzolone, Vithal Madhira, Amy L Olex, Jing Sun, Dimple Vaidya, Nada Fadul, Jessica Y Islam, Lesley E Jackson, Kenneth J Wilkins, Zachary Butzin-Dozier, Dongmei Li, Sandra E Safo, Julie A McMurry, Pooja Maheria, Tommy Williams, Shukri A Hassan, Melissa A Haendel, Rena C Patel\",\"doi\":\"10.2196/68143\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Electronic health records (EHRs) provide valuable insights to address clinical and epidemiological research concerning HIV, including the disproportionate impact of the COVID-19 pandemic on people living with HIV. To identify this population, most studies using EHR or claims databases start with diagnostic codes, which can result in misclassification without further refinement using drug or laboratory data. Furthermore, given that antiretrovirals now have indications for both HIV and COVID-19 (ie, ritonavir in nirmatrelvir/ritonavir), new phenotyping methods are needed to better capture people living with HIV. Therefore, we created a generalizable and innovative method to robustly identify people living with HIV, preexposure prophylaxis (PrEP) users, postexposure prophylaxis (PEP) users, and people not living with HIV using granular clinical data after the emergence of COVID-19.</p><p><strong>Objective: </strong>The primary aim of this study was to use computational phenotyping in EHR data to identify people living with HIV (cohort 1), PrEP users (cohort 2), PEP users (cohort 3), or \\\"none of the above\\\" (people not living with HIV; cohort 4) and describe COVID-19-related characteristics among these cohorts.</p><p><strong>Methods: </strong>We used diagnostic and laboratory measurements and drug concepts in the National Clinical Cohort Collaborative to create a computational phenotype for the 4 cohorts with confidence levels. For robustness, we conducted a randomly sampled, blinded clinician annotation to assess precision. We calculated the distribution of demographics, comorbidities, and COVID-19 variables among the 4 cohorts.</p><p><strong>Results: </strong>We identified 132,664 people living with HIV with a high level of confidence, 36,088 PrEP users, 4120 PEP users, and 20,639,675 people not living with HIV. Most people living with HIV were identified by a combination of medical conditions, laboratory measurements, and drug exposures (74,809/132,664, 56.4%), followed by laboratory measurements and drug exposures (15,241/132,664, 11.5%) and then by medical conditions and drug exposures (14,595/132,664, 11%). A higher proportion of people living with HIV experienced COVID-19-related hospitalization (4650,132,664, 3.5%) or mortality (828/132,664, 0.6%) and all-cause mortality (2083/132,664, 1.6%) compared to other cohorts.</p><p><strong>Conclusions: </strong>Using an extensive phenotyping algorithm leveraging granular data in an EHR repository, we have identified people living with HIV, people not living with HIV, PrEP users, and PEP users. Our findings offer transferable lessons to optimize future EHR phenotyping for these cohorts.</p>\",\"PeriodicalId\":56334,\"journal\":{\"name\":\"JMIR Medical Informatics\",\"volume\":\"13 \",\"pages\":\"e68143\"},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2025-07-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"JMIR Medical Informatics\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.2196/68143\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"MEDICAL INFORMATICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"JMIR Medical Informatics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.2196/68143","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MEDICAL INFORMATICS","Score":null,"Total":0}
引用次数: 0

摘要

背景:电子健康记录(EHRs)为解决有关艾滋病毒的临床和流行病学研究提供了宝贵的见解,包括COVID-19大流行对艾滋病毒感染者的不成比例的影响。为了确定这一人群,大多数使用电子病历或索赔数据库的研究从诊断代码开始,这可能导致错误分类,而没有使用药物或实验室数据进一步改进。此外,鉴于抗逆转录病毒药物现在既有针对艾滋病毒的适应症,也有针对COVID-19的适应症(即尼马特韦/利托那韦中的利托那韦),需要新的分型方法来更好地捕捉艾滋病毒感染者。因此,我们创建了一种通用的创新方法,可以在COVID-19出现后使用颗粒临床数据来稳健地识别艾滋病毒感染者,暴露前预防(PrEP)使用者,暴露后预防(PEP)使用者和非艾滋病毒感染者。目的:本研究的主要目的是在EHR数据中使用计算表型来识别艾滋病毒感染者(队列1)、PrEP使用者(队列2)、PEP使用者(队列3)或“以上皆非”(未感染艾滋病毒的人;队列4),并描述这些队列中与covid -19相关的特征。方法:我们使用诊断和实验室测量以及国家临床队列协作中的药物概念,为4个具有置信水平的队列创建计算表型。为了稳健性,我们进行了随机抽样、盲法临床医生注释来评估准确性。我们计算了4个队列中人口统计学、合并症和COVID-19变量的分布。结果:我们确定了132664名艾滋病毒感染者,其中36088人使用PrEP, 4120人使用PEP, 20639675人没有感染艾滋病毒。大多数艾滋病毒感染者是通过综合医疗条件、实验室测量和药物暴露来确定的(74,809/132,664,56.4%),其次是实验室测量和药物暴露(15,241/132,664,11.5%),然后是医疗条件和药物暴露(14,595/132,664,11%)。与其他队列相比,艾滋病毒感染者经历与covid -19相关的住院(4650人,132664人,3.5%)或死亡率(828人/ 132664人,0.6%)和全因死亡率(2083人/ 132664人,1.6%)的比例更高。结论:使用广泛的表现型算法,利用EHR存储库中的颗粒数据,我们已经确定了艾滋病毒感染者,非艾滋病毒感染者,PrEP使用者和PEP使用者。我们的研究结果为优化这些人群未来的电子病历表型提供了可转移的经验教训。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Identifying People Living With or Those at Risk for HIV in a Nationally Sampled Electronic Health Record Repository Called the National Clinical Cohort Collaborative: Computational Phenotyping Study.

Background: Electronic health records (EHRs) provide valuable insights to address clinical and epidemiological research concerning HIV, including the disproportionate impact of the COVID-19 pandemic on people living with HIV. To identify this population, most studies using EHR or claims databases start with diagnostic codes, which can result in misclassification without further refinement using drug or laboratory data. Furthermore, given that antiretrovirals now have indications for both HIV and COVID-19 (ie, ritonavir in nirmatrelvir/ritonavir), new phenotyping methods are needed to better capture people living with HIV. Therefore, we created a generalizable and innovative method to robustly identify people living with HIV, preexposure prophylaxis (PrEP) users, postexposure prophylaxis (PEP) users, and people not living with HIV using granular clinical data after the emergence of COVID-19.

Objective: The primary aim of this study was to use computational phenotyping in EHR data to identify people living with HIV (cohort 1), PrEP users (cohort 2), PEP users (cohort 3), or "none of the above" (people not living with HIV; cohort 4) and describe COVID-19-related characteristics among these cohorts.

Methods: We used diagnostic and laboratory measurements and drug concepts in the National Clinical Cohort Collaborative to create a computational phenotype for the 4 cohorts with confidence levels. For robustness, we conducted a randomly sampled, blinded clinician annotation to assess precision. We calculated the distribution of demographics, comorbidities, and COVID-19 variables among the 4 cohorts.

Results: We identified 132,664 people living with HIV with a high level of confidence, 36,088 PrEP users, 4120 PEP users, and 20,639,675 people not living with HIV. Most people living with HIV were identified by a combination of medical conditions, laboratory measurements, and drug exposures (74,809/132,664, 56.4%), followed by laboratory measurements and drug exposures (15,241/132,664, 11.5%) and then by medical conditions and drug exposures (14,595/132,664, 11%). A higher proportion of people living with HIV experienced COVID-19-related hospitalization (4650,132,664, 3.5%) or mortality (828/132,664, 0.6%) and all-cause mortality (2083/132,664, 1.6%) compared to other cohorts.

Conclusions: Using an extensive phenotyping algorithm leveraging granular data in an EHR repository, we have identified people living with HIV, people not living with HIV, PrEP users, and PEP users. Our findings offer transferable lessons to optimize future EHR phenotyping for these cohorts.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
JMIR Medical Informatics
JMIR Medical Informatics Medicine-Health Informatics
CiteScore
7.90
自引率
3.10%
发文量
173
审稿时长
12 weeks
期刊介绍: JMIR Medical Informatics (JMI, ISSN 2291-9694) is a top-rated, tier A journal which focuses on clinical informatics, big data in health and health care, decision support for health professionals, electronic health records, ehealth infrastructures and implementation. It has a focus on applied, translational research, with a broad readership including clinicians, CIOs, engineers, industry and health informatics professionals. Published by JMIR Publications, publisher of the Journal of Medical Internet Research (JMIR), the leading eHealth/mHealth journal (Impact Factor 2016: 5.175), JMIR Med Inform has a slightly different scope (emphasizing more on applications for clinicians and health professionals rather than consumers/citizens, which is the focus of JMIR), publishes even faster, and also allows papers which are more technical or more formative than what would be published in the Journal of Medical Internet Research.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信