Rule-based natural language processing to examine variation in worsening heart failure hospitalizations by age, sex, race and ethnicity, and left ventricular ejection fraction
Matthew T. Mefford PhD , Andrew P. Ambrosy MD , Rong Wei MS , Chengyi Zheng PhD , Rishi V. Parikh MPH , Teresa N. Harrison SM , Ming-Sum Lee MD , Alan S. Go MD , Kristi Reynolds PhD
{"title":"Rule-based natural language processing to examine variation in worsening heart failure hospitalizations by age, sex, race and ethnicity, and left ventricular ejection fraction","authors":"Matthew T. Mefford PhD , Andrew P. Ambrosy MD , Rong Wei MS , Chengyi Zheng PhD , Rishi V. Parikh MPH , Teresa N. Harrison SM , Ming-Sum Lee MD , Alan S. Go MD , Kristi Reynolds PhD","doi":"10.1016/j.ahj.2024.09.001","DOIUrl":null,"url":null,"abstract":"<div><h3>Background</h3><div>Prior studies characterizing worsening heart failure events (WHFE) have been limited in using structured healthcare data from hospitalizations, and with little exploration of sociodemographic variation. The current study examined the impact of incorporating unstructured data to identify WHFE, describing age-, sex-, race and ethnicity-, and left ventricular ejection fraction (LVEF)-specific rates.</div></div><div><h3>Methods</h3><div>Adult members of Kaiser Permanente Southern California (KPSC) with a HF diagnosis between 2014 and 2018 were followed through 2019 to identify hospitalized WHFE. The main outcome was hospitalizations with a principal or secondary HF discharge diagnosis meeting rule-based Natural Language Processing (NLP) criteria for WHFE. In comparison, we examined hospitalizations with a principal discharge diagnosis of HF. Age-, sex-, and race and ethnicity-adjusted rates per 100 person-years (PY) were calculated among age, sex, race and ethnicity (non-Hispanic (NH) Asian/Pacific Islander [API], Hispanic, NH Black, NH White) and LVEF subgroups.</div></div><div><h3>Results</h3><div>Among 44,863 adults with HF, 10,560 (23.5%) had an NLP-defined, hospitalized WHFE. Adjusted rates (per 100 PY) of WHFE using NLP were higher compared to rates based only on HF principal discharge diagnosis codes (12.7 and 9.3, respectively), and this followed similar patterns among subgroups, with the highest rates among adults ≥75 years (16.3 and 11.2), men (13.2 and 9.7), and NH Black (16.9 and 14.3) and Hispanic adults (15.3 and 11.4), and adults with reduced LVEF (16.2 and 14.0). Using NLP disproportionately increased the perceived burden of WHFE among API and adults with mid-range and preserved LVEF.</div></div><div><h3>Conclusion</h3><div>Rule-based NLP improved the capture of hospitalized WHFE above principal discharge diagnosis codes alone. Applying standardized consensus definitions to EHR data may improve understanding of the burden of WHFE and promote optimal care overall and in specific sociodemographic groups.</div></div>","PeriodicalId":7868,"journal":{"name":"American heart journal","volume":"278 ","pages":"Pages 117-126"},"PeriodicalIF":3.7000,"publicationDate":"2024-09-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"American heart journal","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0002870324002345","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CARDIAC & CARDIOVASCULAR SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Background
Prior studies characterizing worsening heart failure events (WHFE) have been limited in using structured healthcare data from hospitalizations, and with little exploration of sociodemographic variation. The current study examined the impact of incorporating unstructured data to identify WHFE, describing age-, sex-, race and ethnicity-, and left ventricular ejection fraction (LVEF)-specific rates.
Methods
Adult members of Kaiser Permanente Southern California (KPSC) with a HF diagnosis between 2014 and 2018 were followed through 2019 to identify hospitalized WHFE. The main outcome was hospitalizations with a principal or secondary HF discharge diagnosis meeting rule-based Natural Language Processing (NLP) criteria for WHFE. In comparison, we examined hospitalizations with a principal discharge diagnosis of HF. Age-, sex-, and race and ethnicity-adjusted rates per 100 person-years (PY) were calculated among age, sex, race and ethnicity (non-Hispanic (NH) Asian/Pacific Islander [API], Hispanic, NH Black, NH White) and LVEF subgroups.
Results
Among 44,863 adults with HF, 10,560 (23.5%) had an NLP-defined, hospitalized WHFE. Adjusted rates (per 100 PY) of WHFE using NLP were higher compared to rates based only on HF principal discharge diagnosis codes (12.7 and 9.3, respectively), and this followed similar patterns among subgroups, with the highest rates among adults ≥75 years (16.3 and 11.2), men (13.2 and 9.7), and NH Black (16.9 and 14.3) and Hispanic adults (15.3 and 11.4), and adults with reduced LVEF (16.2 and 14.0). Using NLP disproportionately increased the perceived burden of WHFE among API and adults with mid-range and preserved LVEF.
Conclusion
Rule-based NLP improved the capture of hospitalized WHFE above principal discharge diagnosis codes alone. Applying standardized consensus definitions to EHR data may improve understanding of the burden of WHFE and promote optimal care overall and in specific sociodemographic groups.
期刊介绍:
The American Heart Journal will consider for publication suitable articles on topics pertaining to the broad discipline of cardiovascular disease. Our goal is to provide the reader primary investigation, scholarly review, and opinion concerning the practice of cardiovascular medicine. We especially encourage submission of 3 types of reports that are not frequently seen in cardiovascular journals: negative clinical studies, reports on study designs, and studies involving the organization of medical care. The Journal does not accept individual case reports or original articles involving bench laboratory or animal research.