Improving the phenotype risk score as a scalable approach to identifying patients with Mendelian disease

Journal of the American Medical Informatics Association : JAMIA Pub Date : 2019-10-14 DOI:10.1093/jamia/ocz179

L. Bastarache, J. Hughey, J. Goldstein, Julie A Bastraache, Satya N. Das, Neil Zaki, Chenjie Zeng, Leigh Anne Tang, D. Roden, J. Denny

{"title":"Improving the phenotype risk score as a scalable approach to identifying patients with Mendelian disease","authors":"L. Bastarache, J. Hughey, J. Goldstein, Julie A Bastraache, Satya N. Das, Neil Zaki, Chenjie Zeng, Leigh Anne Tang, D. Roden, J. Denny","doi":"10.1093/jamia/ocz179","DOIUrl":null,"url":null,"abstract":"OBJECTIVE\nThe Phenotype Risk Score (PheRS) is a method to detect Mendelian disease patterns using phenotypes from the electronic health record (EHR). We compared the performance of different approaches mapping EHR phenotypes to Mendelian disease features.\n\n\nMATERIALS AND METHODS\nPheRS utilizes Mendelian diseases descriptions annotated with Human Phenotype Ontology (HPO) terms. In previous work, we presented a map linking phecodes (based on International Classification of Diseases [ICD]-Ninth Revision) to HPO terms. For this study, we integrated ICD-Tenth Revision codes and lab data. We also created a new map between HPO terms using customized groupings of ICD codes. We compared the performance with cases and controls for 16 Mendelian diseases using 2.5 million de-identified medical records.\n\n\nRESULTS\nPheRS effectively distinguished cases from controls for all 15 positive controls and all approaches tested (P < 4 × 1016). Adding lab data led to a statistically significant improvement for 4 of 14 diseases. The custom ICD groupings improved specificity, leading to an average 8% increase for precision at 100 (-2% to 22%). Eight of 10 adults with cystic fibrosis tested had PheRS in the 95th percentile prio to diagnosis.\n\n\nDISCUSSION\nBoth phecodes and custom ICD groupings were able to detect differences between affected cases and controls at the population level. The ICD map showed better precision for the highest scoring individuals. Adding lab data improved performance at detecting population-level differences.\n\n\nCONCLUSIONS\nPheRS is a scalable method to study Mendelian disease at the population level using electronic health record data and can potentially be used to find patients with undiagnosed Mendelian disease.","PeriodicalId":236137,"journal":{"name":"Journal of the American Medical Informatics Association : JAMIA","volume":"123 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-10-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"28","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the American Medical Informatics Association : JAMIA","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/jamia/ocz179","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 28

Abstract

OBJECTIVE The Phenotype Risk Score (PheRS) is a method to detect Mendelian disease patterns using phenotypes from the electronic health record (EHR). We compared the performance of different approaches mapping EHR phenotypes to Mendelian disease features. MATERIALS AND METHODS PheRS utilizes Mendelian diseases descriptions annotated with Human Phenotype Ontology (HPO) terms. In previous work, we presented a map linking phecodes (based on International Classification of Diseases [ICD]-Ninth Revision) to HPO terms. For this study, we integrated ICD-Tenth Revision codes and lab data. We also created a new map between HPO terms using customized groupings of ICD codes. We compared the performance with cases and controls for 16 Mendelian diseases using 2.5 million de-identified medical records. RESULTS PheRS effectively distinguished cases from controls for all 15 positive controls and all approaches tested (P < 4 × 1016). Adding lab data led to a statistically significant improvement for 4 of 14 diseases. The custom ICD groupings improved specificity, leading to an average 8% increase for precision at 100 (-2% to 22%). Eight of 10 adults with cystic fibrosis tested had PheRS in the 95th percentile prio to diagnosis. DISCUSSION Both phecodes and custom ICD groupings were able to detect differences between affected cases and controls at the population level. The ICD map showed better precision for the highest scoring individuals. Adding lab data improved performance at detecting population-level differences. CONCLUSIONS PheRS is a scalable method to study Mendelian disease at the population level using electronic health record data and can potentially be used to find patients with undiagnosed Mendelian disease.

查看原文本刊更多论文

改善表型风险评分作为一种可扩展的方法来识别孟德尔病患者

目的表型风险评分(PheRS)是一种利用电子健康记录(EHR)中的表型检测孟德尔疾病模式的方法。我们比较了将EHR表型映射到孟德尔疾病特征的不同方法的性能。材料和方法sphers采用孟德尔疾病描述，并附有人类表型本体论(HPO)术语注释。在之前的工作中，我们提出了一个链接密码(基于国际疾病分类[ICD]-第九版)到HPO术语的地图。在本研究中，我们整合了icd -第十版代码和实验室数据。我们还使用自定义的ICD代码分组在HPO术语之间创建了新的映射。我们使用250万份去识别的医疗记录，将16种孟德尔疾病的病例和对照组的表现进行了比较。结果在所有15个阳性对照和所有检测方法中，sphers都能有效地将病例与对照组区分开来(P < 4 × 1016)。添加实验室数据导致14种疾病中的4种有统计学上的显著改善。自定义ICD分组提高了特异性，导致100的精确度平均提高8%(-2%至22%)。10名囊性纤维化患者中有8人在诊断前的PheRS处于第95百分位数。密码和自定义ICD分组都能够在人群水平上发现受影响病例和对照之间的差异。ICD地图对得分最高的个体显示出更高的准确性。添加实验室数据提高了检测种群水平差异的性能。结论sphers是一种在人群水平上利用电子病历数据研究孟德尔病的可扩展方法，可用于发现未确诊的孟德尔病患者。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of the American Medical Informatics Association : JAMIA

自引率

0.00%

发文量