Towards comprehensive longitudinal healthcare data capture

2012 IEEE International Conference on Bioinformatics and Biomedicine Workshops Pub Date : 2012-10-04 DOI:10.1109/BIBMW.2012.6470310

Delroy Cameron, Varun Bhagwan, A. Sheth

{"title":"Towards comprehensive longitudinal healthcare data capture","authors":"Delroy Cameron, Varun Bhagwan, A. Sheth","doi":"10.1109/BIBMW.2012.6470310","DOIUrl":null,"url":null,"abstract":"The ability to connect the dots in structured background knowledge and also across scientific literature has been demonstrated as a critical aspect of knowledge discovery. It is not unreasonable therefore to expect that connecting-the-dots across massive amounts of healthcare data may also lead to new insights that could impact diagnosis, treatment and overall patient care. Of critical importance is the observation that while structured Electronic Medical Records (EMR) are useful sources of health information, it is often the unstructured clinical texts such as progress notes and discharge summaries that contain rich, updated and granular information. Hence, by coupling structured EMR data with data from unstructured clinical texts, more holistic patient records, needed for connecting the dots, can be obtained. Unfortunately, free-text progress notes are fraught with a lack of proper grammatical structure, and contain liberal use of jargon and abbreviations, together with frequent misspellings. While these notes still serve their intended purpose for medical care, automatically extracting semantic information from them is a complex task. Overcoming this complexity could mean that evidence-based support for structured EMR data using unstructured clinical texts, can be provided. In this work therefore, we explore a pattern-based approach for extracting Smoker Semantic Types (SST) from unstructured clinical notes, in order to enable evidence-based resolution of SSTs asserted in structured EMRs using SSTs extracted from unstructured clinical notes. Our findings support the notion that information present in unstructured clinical text can be used to complement structured healthcare data. This is a crucial observation towards creating comprehensive longitudinal patient models for connecting-the-dots and providing better overall patient care.","PeriodicalId":6392,"journal":{"name":"2012 IEEE International Conference on Bioinformatics and Biomedicine Workshops","volume":"92 1","pages":"240-247"},"PeriodicalIF":0.0000,"publicationDate":"2012-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 IEEE International Conference on Bioinformatics and Biomedicine Workshops","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBMW.2012.6470310","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

Abstract

The ability to connect the dots in structured background knowledge and also across scientific literature has been demonstrated as a critical aspect of knowledge discovery. It is not unreasonable therefore to expect that connecting-the-dots across massive amounts of healthcare data may also lead to new insights that could impact diagnosis, treatment and overall patient care. Of critical importance is the observation that while structured Electronic Medical Records (EMR) are useful sources of health information, it is often the unstructured clinical texts such as progress notes and discharge summaries that contain rich, updated and granular information. Hence, by coupling structured EMR data with data from unstructured clinical texts, more holistic patient records, needed for connecting the dots, can be obtained. Unfortunately, free-text progress notes are fraught with a lack of proper grammatical structure, and contain liberal use of jargon and abbreviations, together with frequent misspellings. While these notes still serve their intended purpose for medical care, automatically extracting semantic information from them is a complex task. Overcoming this complexity could mean that evidence-based support for structured EMR data using unstructured clinical texts, can be provided. In this work therefore, we explore a pattern-based approach for extracting Smoker Semantic Types (SST) from unstructured clinical notes, in order to enable evidence-based resolution of SSTs asserted in structured EMRs using SSTs extracted from unstructured clinical notes. Our findings support the notion that information present in unstructured clinical text can be used to complement structured healthcare data. This is a crucial observation towards creating comprehensive longitudinal patient models for connecting-the-dots and providing better overall patient care.

查看原文本刊更多论文

实现全面的纵向医疗保健数据捕获

连接结构化背景知识和科学文献中的点的能力已被证明是知识发现的一个关键方面。因此，期望将大量医疗保健数据中的点连接起来也可能产生新的见解，从而影响诊断、治疗和整体患者护理，这并非不合理。至关重要的是，虽然结构化电子医疗记录(EMR)是有用的健康信息来源，但通常是非结构化的临床文本，如进度记录和出院摘要，包含丰富的、更新的和细粒度的信息。因此，通过将结构化EMR数据与非结构化临床文本的数据相结合，可以获得连接各个点所需的更全面的患者记录。不幸的是，自由文本进度笔记充满了缺乏适当的语法结构，并且包含大量使用术语和缩写，以及频繁的拼写错误。虽然这些笔记仍然服务于医疗保健的预期目的，但自动从中提取语义信息是一项复杂的任务。克服这种复杂性可能意味着可以使用非结构化临床文本为结构化电子病历数据提供循证支持。因此，在这项工作中，我们探索了一种基于模式的方法，用于从非结构化临床记录中提取吸烟者语义类型(SST)，以便使用从非结构化临床记录中提取的SST来实现结构化电子病历中断言的SST的循证解决。我们的研究结果支持这样一种观点，即非结构化临床文本中的信息可以用来补充结构化医疗数据。这是一个重要的观察结果，有助于创建全面的纵向患者模型，将各个点连接起来，提供更好的整体患者护理。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2012 IEEE International Conference on Bioinformatics and Biomedicine Workshops

自引率

0.00%

发文量