Automatic information extraction from patient records in Bulgarian language

G. Angelova
{"title":"Automatic information extraction from patient records in Bulgarian language","authors":"G. Angelova","doi":"10.1145/2516775.2516777","DOIUrl":null,"url":null,"abstract":"Natural Language Processing (NLP) has been viewed as a promising technology in medical informatics since decades. Despite the gradually improving quality of automatic text analysis, however, clinical NLP systems are still rarely used outside the research Labs due to the following reasons: (i) their development is very expensive so most of them are prototypes or proof-of-concept demonstrators, (ii) real exploitation of NLP modules would require constant support of the underlying linguistic resources and tuning the systems to new text types; (iii) the technology has potentially high accuracy but some results might be erroneous and misleading [1]. On the other hand, the quick adoption of Electronic Health Records worldwide implies constant growth of electronic narratives discussing patient-related information. According to the established medical practices, the most important findings about the patients are still kept as free texts in various documents and languages. In this way the so called Information Extraction (IE) becomes the dominating language technology that is currently applied to biomedical texts. The main idea is to extract automatically important entities, with accuracy as high as possible, and to operate on these entities skipping the remaining text fragments. IE is based on shallow analysis only but it is expected that even the progress in partial text understanding would enable radical improvements in clinical decision support, biomedical research and healthcare in general.","PeriodicalId":316788,"journal":{"name":"International Conference on Computer Systems and Technologies","volume":"80 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2013-06-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Computer Systems and Technologies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2516775.2516777","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Natural Language Processing (NLP) has been viewed as a promising technology in medical informatics since decades. Despite the gradually improving quality of automatic text analysis, however, clinical NLP systems are still rarely used outside the research Labs due to the following reasons: (i) their development is very expensive so most of them are prototypes or proof-of-concept demonstrators, (ii) real exploitation of NLP modules would require constant support of the underlying linguistic resources and tuning the systems to new text types; (iii) the technology has potentially high accuracy but some results might be erroneous and misleading [1]. On the other hand, the quick adoption of Electronic Health Records worldwide implies constant growth of electronic narratives discussing patient-related information. According to the established medical practices, the most important findings about the patients are still kept as free texts in various documents and languages. In this way the so called Information Extraction (IE) becomes the dominating language technology that is currently applied to biomedical texts. The main idea is to extract automatically important entities, with accuracy as high as possible, and to operate on these entities skipping the remaining text fragments. IE is based on shallow analysis only but it is expected that even the progress in partial text understanding would enable radical improvements in clinical decision support, biomedical research and healthcare in general.
保加利亚语患者记录自动信息提取
自然语言处理(NLP)在医学信息学领域一直被视为一种很有前途的技术。尽管自动文本分析的质量逐渐提高,但临床NLP系统仍然很少在研究实验室之外使用,原因如下:(i)它们的开发非常昂贵,因此大多数是原型或概念验证演示,(ii)真正利用NLP模块将需要不断支持底层语言资源并调整系统以适应新的文本类型;(iii)该技术具有潜在的高准确性,但某些结果可能是错误的和误导性的。另一方面,电子健康记录在世界范围内的迅速采用意味着讨论患者相关信息的电子叙述的不断增长。根据既定的医疗惯例,关于病人的最重要的发现仍然以各种文件和语言作为免费文本保存。通过这种方式,所谓的信息提取(IE)成为目前应用于生物医学文本的主导语言技术。其主要思想是以尽可能高的准确性自动提取重要实体,并对这些实体进行操作,跳过剩余的文本片段。IE仅基于肤浅的分析,但预计即使是部分文本理解方面的进展也将使临床决策支持、生物医学研究和一般医疗保健方面的根本改进成为可能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信