Leveraging Natural Language Processing to Augment Structured Social Determinants of Health Data in the Electronic Health Record

K. Lybarger, Nicholas J. Dobbins, Ritche Long, Angad Singh, Patrick Wedgeworth, Özlem Ozuner, Meliha Yetisgen-Yildiz
{"title":"Leveraging Natural Language Processing to Augment Structured Social Determinants of Health Data in the Electronic Health Record","authors":"K. Lybarger, Nicholas J. Dobbins, Ritche Long, Angad Singh, Patrick Wedgeworth, Özlem Ozuner, Meliha Yetisgen-Yildiz","doi":"10.48550/arXiv.2212.07538","DOIUrl":null,"url":null,"abstract":"OBJECTIVE\nSocial determinants of health (SDOH) impact health outcomes and are documented in the electronic health record (EHR) through structured data and unstructured clinical notes. However, clinical notes often contain more comprehensive SDOH information, detailing aspects such as status, severity, and temporality. This work has two primary objectives: (1) develop a natural language processing information extraction model to capture detailed SDOH information and (2) evaluate the information gain achieved by applying the SDOH extractor to clinical narratives and combining the extracted representations with existing structured data.\n\n\nMATERIALS AND METHODS\nWe developed a novel SDOH extractor using a deep learning entity and relation extraction architecture to characterize SDOH across various dimensions. In an EHR case study, we applied the SDOH extractor to a large clinical data set with 225 089 patients and 430 406 notes with social history sections and compared the extracted SDOH information with existing structured data.\n\n\nRESULTS\nThe SDOH extractor achieved 0.86 F1 on a withheld test set. In the EHR case study, we found extracted SDOH information complements existing structured data with 32% of homeless patients, 19% of current tobacco users, and 10% of drug users only having these health risk factors documented in the clinical narrative.\n\n\nCONCLUSIONS\nUtilizing EHR data to identify SDOH health risk factors and social needs may improve patient care and outcomes. Semantic representations of text-encoded SDOH information can augment existing structured data, and this more comprehensive SDOH representation can assist health systems in identifying and addressing these social needs.","PeriodicalId":236137,"journal":{"name":"Journal of the American Medical Informatics Association : JAMIA","volume":"165 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the American Medical Informatics Association : JAMIA","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2212.07538","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6

Abstract

OBJECTIVE Social determinants of health (SDOH) impact health outcomes and are documented in the electronic health record (EHR) through structured data and unstructured clinical notes. However, clinical notes often contain more comprehensive SDOH information, detailing aspects such as status, severity, and temporality. This work has two primary objectives: (1) develop a natural language processing information extraction model to capture detailed SDOH information and (2) evaluate the information gain achieved by applying the SDOH extractor to clinical narratives and combining the extracted representations with existing structured data. MATERIALS AND METHODS We developed a novel SDOH extractor using a deep learning entity and relation extraction architecture to characterize SDOH across various dimensions. In an EHR case study, we applied the SDOH extractor to a large clinical data set with 225 089 patients and 430 406 notes with social history sections and compared the extracted SDOH information with existing structured data. RESULTS The SDOH extractor achieved 0.86 F1 on a withheld test set. In the EHR case study, we found extracted SDOH information complements existing structured data with 32% of homeless patients, 19% of current tobacco users, and 10% of drug users only having these health risk factors documented in the clinical narrative. CONCLUSIONS Utilizing EHR data to identify SDOH health risk factors and social needs may improve patient care and outcomes. Semantic representations of text-encoded SDOH information can augment existing structured data, and this more comprehensive SDOH representation can assist health systems in identifying and addressing these social needs.
利用自然语言处理来增强电子健康记录中健康数据的结构化社会决定因素
健康的社会决定因素(SDOH)影响健康结果,并通过结构化数据和非结构化临床记录记录在电子健康记录(EHR)中。然而,临床记录通常包含更全面的SDOH信息,详细说明状态、严重程度和时间等方面。这项工作有两个主要目标:(1)开发一个自然语言处理信息提取模型来捕获详细的SDOH信息;(2)评估将SDOH提取器应用于临床叙述并将提取的表征与现有结构化数据相结合所获得的信息增益。材料和方法我们开发了一种新的SDOH提取器,使用深度学习实体和关系提取架构来描述各个维度的SDOH。在EHR案例研究中,我们将SDOH提取器应用于一个包含225089例患者和430406份带有社会历史部分的笔记的大型临床数据集,并将提取的SDOH信息与现有结构化数据进行比较。结果在保留的测试集上,SDOH提取器达到了0.86 F1。在电子病历案例研究中,我们发现提取的SDOH信息补充了现有的结构化数据,32%的无家可归患者、19%的当前烟草使用者和10%的吸毒者只有这些健康风险因素记录在临床叙述中。结论利用电子病历数据识别SDOH健康危险因素和社会需求可以改善患者的护理和预后。文本编码的SDOH信息的语义表示可以增强现有的结构化数据,这种更全面的SDOH表示可以帮助卫生系统识别和解决这些社会需求。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信