Optimizing Identification of People Living with HIV from Electronic Medical Records: Computable Phenotype Development and Validation.

IF 1.3 4区 医学 Q3 COMPUTER SCIENCE, INFORMATION SYSTEMS
Methods of Information in Medicine Pub Date : 2021-09-01 Epub Date: 2021-09-30 DOI:10.1055/s-0041-1735619
Yiyang Liu, Khairul A Siddiqi, Robert L Cook, Jiang Bian, Patrick J Squires, Elizabeth A Shenkman, Mattia Prosperi, Dushyantha T Jayaweera
{"title":"Optimizing Identification of People Living with HIV from Electronic Medical Records: Computable Phenotype Development and Validation.","authors":"Yiyang Liu, Khairul A Siddiqi, Robert L Cook, Jiang Bian, Patrick J Squires, Elizabeth A Shenkman, Mattia Prosperi, Dushyantha T Jayaweera","doi":"10.1055/s-0041-1735619","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Electronic health record (EHR)-based computable phenotype algorithms allow researchers to efficiently identify a large virtual cohort of Human Immunodeficiency Virus (HIV) patients. Built upon existing algorithms, we refined, improved, and validated an HIV phenotype algorithm using data from the OneFlorida Data Trust, a repository of linked claims data and EHRs from its clinical partners, which provide care to over 15 million patients across all 67 counties in Florida.</p><p><strong>Methods: </strong>Our computable phenotype examined information from multiple EHR domains, including clinical encounters with diagnoses, prescription medications, and laboratory tests. To identify an HIV case, the algorithm requires the patient to have at least one diagnostic code for HIV and meet one of the following criteria: have 1+ positive HIV laboratory, have been prescribed with HIV medications, or have 3+ visits with HIV diagnostic codes. The computable phenotype was validated against a subset of clinical notes.</p><p><strong>Results: </strong>Among the 15+ million patients from OneFlorida, we identified 61,313 patients with confirmed HIV diagnosis. Among them, 8.05% met all four inclusion criteria, 69.7% met the 3+ HIV encounters criteria in addition to having HIV diagnostic code, and 8.1% met all criteria except for having positive laboratories. Our algorithm achieved higher sensitivity (98.9%) and comparable specificity (97.6%) relative to existing algorithms (77-83% sensitivity, 86-100% specificity). The mean age of the sample was 42.7 years, 58% male, and about half were Black African American. Patients' average follow-up period (the time between the first and last encounter in the EHRs) was approximately 4.6 years. The median number of all encounters and HIV-related encounters were 79 and 21, respectively.</p><p><strong>Conclusion: </strong>By leveraging EHR data from multiple clinical partners and domains, with a considerably diverse population, our algorithm allows more flexible criteria for identifying patients with incomplete laboratory test results and medication prescribing history compared with prior studies.</p>","PeriodicalId":49822,"journal":{"name":"Methods of Information in Medicine","volume":"60 3-04","pages":"84-94"},"PeriodicalIF":1.3000,"publicationDate":"2021-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8672443/pdf/nihms-1761917.pdf","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Methods of Information in Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1055/s-0041-1735619","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2021/9/30 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Electronic health record (EHR)-based computable phenotype algorithms allow researchers to efficiently identify a large virtual cohort of Human Immunodeficiency Virus (HIV) patients. Built upon existing algorithms, we refined, improved, and validated an HIV phenotype algorithm using data from the OneFlorida Data Trust, a repository of linked claims data and EHRs from its clinical partners, which provide care to over 15 million patients across all 67 counties in Florida.

Methods: Our computable phenotype examined information from multiple EHR domains, including clinical encounters with diagnoses, prescription medications, and laboratory tests. To identify an HIV case, the algorithm requires the patient to have at least one diagnostic code for HIV and meet one of the following criteria: have 1+ positive HIV laboratory, have been prescribed with HIV medications, or have 3+ visits with HIV diagnostic codes. The computable phenotype was validated against a subset of clinical notes.

Results: Among the 15+ million patients from OneFlorida, we identified 61,313 patients with confirmed HIV diagnosis. Among them, 8.05% met all four inclusion criteria, 69.7% met the 3+ HIV encounters criteria in addition to having HIV diagnostic code, and 8.1% met all criteria except for having positive laboratories. Our algorithm achieved higher sensitivity (98.9%) and comparable specificity (97.6%) relative to existing algorithms (77-83% sensitivity, 86-100% specificity). The mean age of the sample was 42.7 years, 58% male, and about half were Black African American. Patients' average follow-up period (the time between the first and last encounter in the EHRs) was approximately 4.6 years. The median number of all encounters and HIV-related encounters were 79 and 21, respectively.

Conclusion: By leveraging EHR data from multiple clinical partners and domains, with a considerably diverse population, our algorithm allows more flexible criteria for identifying patients with incomplete laboratory test results and medication prescribing history compared with prior studies.

Abstract Image

Abstract Image

从电子病历中优化艾滋病病毒感染者的识别:可计算表型的开发与验证。
背景:基于电子健康记录(EHR)的可计算表型算法使研究人员能够高效地识别大量人类免疫缺陷病毒(HIV)患者的虚拟队列。在现有算法的基础上,我们利用来自 OneFlorida Data Trust 的数据改进、完善并验证了 HIV 表型算法:我们的可计算表型检查了来自多个 EHR 领域的信息,包括临床诊断、处方药和实验室检测。要确定一个 HIV 病例,该算法要求患者至少有一个 HIV 诊断代码,并符合以下标准之一:1 次以上 HIV 实验室检测呈阳性、开过 HIV 药物处方或 3 次以上就诊有 HIV 诊断代码。根据临床记录子集对可计算表型进行了验证:在来自 OneFlorida 的 1500 多万名患者中,我们发现了 61,313 名确诊为 HIV 的患者。其中,8.05% 的患者符合所有四项纳入标准,69.7% 的患者除有 HIV 诊断代码外,还符合 3+ HIV 诊断标准,8.1% 的患者除实验室阳性外,符合所有标准。与现有算法(灵敏度 77-83%,特异性 86-100%)相比,我们的算法具有更高的灵敏度(98.9%)和相当的特异性(97.6%)。样本的平均年龄为 42.7 岁,58% 为男性,约半数为非洲裔美国黑人。患者的平均随访时间(电子病历中首次就诊与最后一次就诊之间的时间)约为 4.6 年。所有就诊次数和艾滋病相关就诊次数的中位数分别为 79 次和 21 次:通过利用来自多个临床合作伙伴和领域的电子病历数据以及相当多样化的人群,与之前的研究相比,我们的算法能够以更灵活的标准识别实验室检测结果和用药史不完整的患者。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Methods of Information in Medicine
Methods of Information in Medicine 医学-计算机:信息系统
CiteScore
3.70
自引率
11.80%
发文量
33
审稿时长
6-12 weeks
期刊介绍: Good medicine and good healthcare demand good information. Since the journal''s founding in 1962, Methods of Information in Medicine has stressed the methodology and scientific fundamentals of organizing, representing and analyzing data, information and knowledge in biomedicine and health care. Covering publications in the fields of biomedical and health informatics, medical biometry, and epidemiology, the journal publishes original papers, reviews, reports, opinion papers, editorials, and letters to the editor. From time to time, the journal publishes articles on particular focus themes as part of a journal''s issue.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信