Mengying Yan, Hwanhee Hong, Jonathan Wilson, Benjamin A Goldstein
{"title":"Estimating the Observability of an Outcome from an Electronic Health Records Dataset Using External Data.","authors":"Mengying Yan, Hwanhee Hong, Jonathan Wilson, Benjamin A Goldstein","doi":"10.1093/aje/kwaf013","DOIUrl":null,"url":null,"abstract":"<p><p>One of the key limitations of electronic health records (EHR) data is that not all health care encounters are observed. The degree to which patient information is captured is referred to as observability. Poor observability, and in particular differential observability, can lead to biased estimates and inference. As such, understanding the degree of observability is important in EHR based studies. In this study, we propose using external data with known observability to assess the degree of overall observability in EHR. We also construct a test for differential observability in the target EHR dataset. Using principles from the transportability literature, we show that we can use a balancing score based weight to estimate the observability of our target outcome. We conduct a series of simulation experiments to understand the conditions under which dataset features must be required to obtain proper inference. To illustrate this, we consider hospital readmissions among patients with end stage renal disease as our outcome of interest. We use administrative claims data, where the outcome is fully observed, as our external data.</p>","PeriodicalId":7472,"journal":{"name":"American journal of epidemiology","volume":" ","pages":""},"PeriodicalIF":5.0000,"publicationDate":"2025-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"American journal of epidemiology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1093/aje/kwaf013","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 0
Abstract
One of the key limitations of electronic health records (EHR) data is that not all health care encounters are observed. The degree to which patient information is captured is referred to as observability. Poor observability, and in particular differential observability, can lead to biased estimates and inference. As such, understanding the degree of observability is important in EHR based studies. In this study, we propose using external data with known observability to assess the degree of overall observability in EHR. We also construct a test for differential observability in the target EHR dataset. Using principles from the transportability literature, we show that we can use a balancing score based weight to estimate the observability of our target outcome. We conduct a series of simulation experiments to understand the conditions under which dataset features must be required to obtain proper inference. To illustrate this, we consider hospital readmissions among patients with end stage renal disease as our outcome of interest. We use administrative claims data, where the outcome is fully observed, as our external data.
期刊介绍:
The American Journal of Epidemiology is the oldest and one of the premier epidemiologic journals devoted to the publication of empirical research findings, opinion pieces, and methodological developments in the field of epidemiologic research.
It is a peer-reviewed journal aimed at both fellow epidemiologists and those who use epidemiologic data, including public health workers and clinicians.