Shantanu Dev BS , Andrew Zolensky BS , Hanaa Dakour Aridi MD , Catherine Kelty PhD, MS , Mackenzie K. Madison MD, MS , Anush Motaganahalli MPH , Benjamin S. Brooke MD, PhD, FACS , Brian Dixon PhD, MPA , Malaz Boustani MD, MPH , Zina Ben Miled PhD , Ping Zhang PhD , Andrew A. Gonzalez MD, JD, MPH, FACS
{"title":"Use of Deep Learning to Identify Peripheral Arterial Disease Cases From Narrative Clinical Notes","authors":"Shantanu Dev BS , Andrew Zolensky BS , Hanaa Dakour Aridi MD , Catherine Kelty PhD, MS , Mackenzie K. Madison MD, MS , Anush Motaganahalli MPH , Benjamin S. Brooke MD, PhD, FACS , Brian Dixon PhD, MPA , Malaz Boustani MD, MPH , Zina Ben Miled PhD , Ping Zhang PhD , Andrew A. Gonzalez MD, JD, MPH, FACS","doi":"10.1016/j.jss.2024.09.062","DOIUrl":null,"url":null,"abstract":"<div><h3>Introduction</h3><div>Peripheral arterial disease (PAD) is the leading cause of amputation in the United States. Despite affecting 8.5 million Americans and more than 200 million people globally, there are significant gaps in awareness by both patients and providers. Ongoing efforts to raise PAD awareness among both the public and health-care professionals have not met widespread success. Thus, there is a need for alternative methods for identifying PAD patients. One potentially promising strategy leverages natural language processing (NLP) to digitally screen patients for PAD. Prior approaches have applied keyword search (KWS) to billing codes or unstructured clinical narratives to identify patients with PAD. However, KWS is limited by its lack of flexibility, the need for manual algorithm development, inconsistent validation, and an inherent failure to capture patients with undiagnosed PAD. Recent advances in deep learning (DL) allow modern NLP models to learn a conceptual representation of the verbiage associated with PAD. This capability may overcome the characteristic constraints of applying strict rule-based algorithms (i.e., searching for a disease-defining set of keywords or billing codes) to real-world clinical data. Herein, we investigate the use of DL to identify patients with PAD from unstructured notes in the electronic health record (EHR).</div></div><div><h3>Methods</h3><div>Using EHR data from a statewide health information exchange, we first created a dataset of all patients with diagnostic or procedural codes (International Classification of Diseases version 9 or 10 or Current Procedural Terminology) for PAD. This study population was then subdivided into training (70%) and testing (30%) cohorts. We based ground truth labels (PAD <em>versus</em> no PAD) on the presence of a primary diagnostic or procedural billing code for PAD at the encounter level. We implemented our KWS-based identification strategy using the currently published state-of-the-art algorithm for identifying PAD cases from unstructured EHR data. We developed a DL model using a BioMed-RoBERTa base that was fine-tuned on the training cohort. We compared the performance of the KWS algorithm to our DL model on a binary classification task (PAD <em>versus</em> no PAD).</div></div><div><h3>Results</h3><div>Our study included 484,363 encounters across 71,355 patients represented in 2,268,062 notes. For the task of correctly identifying PAD related notes in our testing set, the DL outperformed KWS on all model performance measures (Sens 0.70 <em>versus</em> 0.62; Spec 0.99 <em>versus</em> 0.94; PPV 0.82 <em>versus</em> 0.69; NPV 0.97 <em>versus</em> 0.96; Accuracy 0.96 <em>versus</em> 0.91; <em>P</em> value for all comparisons <0.001).</div></div><div><h3>Conclusions</h3><div>Our findings suggest that DL outperforms KWS for identifying PAD cases from clinical narratives. Future planned work derived from this project will develop models to stage patients based on clinical scoring systems.</div></div>","PeriodicalId":17030,"journal":{"name":"Journal of Surgical Research","volume":"303 ","pages":"Pages 699-708"},"PeriodicalIF":1.8000,"publicationDate":"2024-10-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Surgical Research","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0022480424006103","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"SURGERY","Score":null,"Total":0}
引用次数: 0
Abstract
Introduction
Peripheral arterial disease (PAD) is the leading cause of amputation in the United States. Despite affecting 8.5 million Americans and more than 200 million people globally, there are significant gaps in awareness by both patients and providers. Ongoing efforts to raise PAD awareness among both the public and health-care professionals have not met widespread success. Thus, there is a need for alternative methods for identifying PAD patients. One potentially promising strategy leverages natural language processing (NLP) to digitally screen patients for PAD. Prior approaches have applied keyword search (KWS) to billing codes or unstructured clinical narratives to identify patients with PAD. However, KWS is limited by its lack of flexibility, the need for manual algorithm development, inconsistent validation, and an inherent failure to capture patients with undiagnosed PAD. Recent advances in deep learning (DL) allow modern NLP models to learn a conceptual representation of the verbiage associated with PAD. This capability may overcome the characteristic constraints of applying strict rule-based algorithms (i.e., searching for a disease-defining set of keywords or billing codes) to real-world clinical data. Herein, we investigate the use of DL to identify patients with PAD from unstructured notes in the electronic health record (EHR).
Methods
Using EHR data from a statewide health information exchange, we first created a dataset of all patients with diagnostic or procedural codes (International Classification of Diseases version 9 or 10 or Current Procedural Terminology) for PAD. This study population was then subdivided into training (70%) and testing (30%) cohorts. We based ground truth labels (PAD versus no PAD) on the presence of a primary diagnostic or procedural billing code for PAD at the encounter level. We implemented our KWS-based identification strategy using the currently published state-of-the-art algorithm for identifying PAD cases from unstructured EHR data. We developed a DL model using a BioMed-RoBERTa base that was fine-tuned on the training cohort. We compared the performance of the KWS algorithm to our DL model on a binary classification task (PAD versus no PAD).
Results
Our study included 484,363 encounters across 71,355 patients represented in 2,268,062 notes. For the task of correctly identifying PAD related notes in our testing set, the DL outperformed KWS on all model performance measures (Sens 0.70 versus 0.62; Spec 0.99 versus 0.94; PPV 0.82 versus 0.69; NPV 0.97 versus 0.96; Accuracy 0.96 versus 0.91; P value for all comparisons <0.001).
Conclusions
Our findings suggest that DL outperforms KWS for identifying PAD cases from clinical narratives. Future planned work derived from this project will develop models to stage patients based on clinical scoring systems.
期刊介绍:
The Journal of Surgical Research: Clinical and Laboratory Investigation publishes original articles concerned with clinical and laboratory investigations relevant to surgical practice and teaching. The journal emphasizes reports of clinical investigations or fundamental research bearing directly on surgical management that will be of general interest to a broad range of surgeons and surgical researchers. The articles presented need not have been the products of surgeons or of surgical laboratories.
The Journal of Surgical Research also features review articles and special articles relating to educational, research, or social issues of interest to the academic surgical community.