Lourdes A Valdez, Edgar Javier Hernandez, O'Connor Matthews, Matthew Mulvey, Hillary Crandall, Karen Eilbeck
{"title":"Probabilistic Graphical Models for Evaluating the Utility of Data-Driven ICD Code Categories in Pediatric Sepsis.","authors":"Lourdes A Valdez, Edgar Javier Hernandez, O'Connor Matthews, Matthew Mulvey, Hillary Crandall, Karen Eilbeck","doi":"","DOIUrl":null,"url":null,"abstract":"<p><p>Electronic health records (EHRs) are information systems designed to collect and manage clinical data in order to support various clinical activities. They have emerged as valuable sources of data for outcomes research, offering vast repositories of patient information for analysis. Definitions for pediatric sepsis diagnosis are ambiguous, resulting in delayed diagnosis and treatment, highlighting the need for precise and efficient patient categorizing techniques. Nevertheless, the use of EHRs in research poses challenges. Although EHRs were originally created to document patient encounters, the medical coding was designed to satisfy billing requirements. As a result, EHR data may lack granularity, potentially leading to misclassification and incomplete representation of patient conditions. We compared data-driven ICD code categories to chart review using probabilistic graphical models (PGMs) due to their ability to handle uncertainty and incorporate prior knowledge. Overall, this paper demonstrates the potential of using PGMs to address these challenges and improve the analysis of ICD codes for sepsis outcomes research.</p>","PeriodicalId":72180,"journal":{"name":"AMIA ... Annual Symposium proceedings. AMIA Symposium","volume":"2024 ","pages":"1149-1158"},"PeriodicalIF":0.0000,"publicationDate":"2025-05-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12099341/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"AMIA ... Annual Symposium proceedings. AMIA Symposium","FirstCategoryId":"1085","ListUrlMain":"","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Electronic health records (EHRs) are information systems designed to collect and manage clinical data in order to support various clinical activities. They have emerged as valuable sources of data for outcomes research, offering vast repositories of patient information for analysis. Definitions for pediatric sepsis diagnosis are ambiguous, resulting in delayed diagnosis and treatment, highlighting the need for precise and efficient patient categorizing techniques. Nevertheless, the use of EHRs in research poses challenges. Although EHRs were originally created to document patient encounters, the medical coding was designed to satisfy billing requirements. As a result, EHR data may lack granularity, potentially leading to misclassification and incomplete representation of patient conditions. We compared data-driven ICD code categories to chart review using probabilistic graphical models (PGMs) due to their ability to handle uncertainty and incorporate prior knowledge. Overall, this paper demonstrates the potential of using PGMs to address these challenges and improve the analysis of ICD codes for sepsis outcomes research.