Structuring unstructured clinical narratives in OpenMRS with medical concept extraction

R. Eshleman, Hui Yang, Barry Levine
{"title":"Structuring unstructured clinical narratives in OpenMRS with medical concept extraction","authors":"R. Eshleman, Hui Yang, Barry Levine","doi":"10.1109/BIBM.2015.7359782","DOIUrl":null,"url":null,"abstract":"We have developed a new software module for the open source Electronic Medical Record System OpenMRS to analyze unstructured clinical narratives. This module leverages Named Entity Recognition (NER) to deliver concise, semantic-type driven, interactive summaries of clinical notes. To this end, we performed an extensive empirical evaluation of four Named Entity Recognition (NER) systems using textual clinical narratives and full biomedical journal articles. The four NER systems under evaluation are MetaMap, cTAKES, BANNER. We studied several ensemble approaches built upon the above four NER systems to exploit their collaborative strengths. Evaluations are performed on the manually annotated patient discharge summaries from the Informatics for Integrating Biology and the Bedside group (I2B2) and the CRAFT dataset. The main results include (1) BANNER significantly outperforms the other three systems on the I2B2 dataset with F1 values in the range of .73-.89, in contrast to .28 - .60 of other systems; and (2) Surprisingly, an ensemble approach of BANNER with any combinations of the other three approaches tends to degrade the performance by .08 - .11 in F1 when evaluated on the I2B2 dataset. Based on the evaluation results, we have developed a BANNER-based NER module for OpenMRS to recognize semantic concepts including problems, tests, and treatments. This module works with OpenMRS versions 2.×. The user interface presents concise clinical notes summaries and allows the user to filter, search and view the context of the concepts. We have also developed a companion web application to retrain the BANNER model using data from OpenMRS. The module and source code are available at wiki.openmrs.org/display/docs/Visit+Notes+Analysis+Module.","PeriodicalId":186217,"journal":{"name":"2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BIBM.2015.7359782","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

We have developed a new software module for the open source Electronic Medical Record System OpenMRS to analyze unstructured clinical narratives. This module leverages Named Entity Recognition (NER) to deliver concise, semantic-type driven, interactive summaries of clinical notes. To this end, we performed an extensive empirical evaluation of four Named Entity Recognition (NER) systems using textual clinical narratives and full biomedical journal articles. The four NER systems under evaluation are MetaMap, cTAKES, BANNER. We studied several ensemble approaches built upon the above four NER systems to exploit their collaborative strengths. Evaluations are performed on the manually annotated patient discharge summaries from the Informatics for Integrating Biology and the Bedside group (I2B2) and the CRAFT dataset. The main results include (1) BANNER significantly outperforms the other three systems on the I2B2 dataset with F1 values in the range of .73-.89, in contrast to .28 - .60 of other systems; and (2) Surprisingly, an ensemble approach of BANNER with any combinations of the other three approaches tends to degrade the performance by .08 - .11 in F1 when evaluated on the I2B2 dataset. Based on the evaluation results, we have developed a BANNER-based NER module for OpenMRS to recognize semantic concepts including problems, tests, and treatments. This module works with OpenMRS versions 2.×. The user interface presents concise clinical notes summaries and allows the user to filter, search and view the context of the concepts. We have also developed a companion web application to retrain the BANNER model using data from OpenMRS. The module and source code are available at wiki.openmrs.org/display/docs/Visit+Notes+Analysis+Module.
利用医学概念提取在OpenMRS中构建非结构化临床叙述
我们为开源电子病历系统OpenMRS开发了一个新的软件模块来分析非结构化的临床叙述。该模块利用命名实体识别(NER)提供简明的、语义类型驱动的、交互式的临床笔记摘要。为此,我们使用临床文本叙述和完整的生物医学期刊文章对四种命名实体识别(NER)系统进行了广泛的实证评估。正在评估的四个NER系统是MetaMap, cTAKES, BANNER。我们研究了基于上述四个NER系统的几种集成方法,以利用它们的协作优势。对来自整合生物学信息学和床边组(I2B2)和CRAFT数据集的人工注释的患者出院摘要进行评估。主要结果包括:(1)BANNER在I2B2数据集上显著优于其他三个系统,F1值在0.73 -范围内。89,而其他系统为0.28 - 0.60;(2)令人惊讶的是,当在I2B2数据集上进行评估时,BANNER的集成方法与其他三种方法的任何组合往往会使F1的性能降低0.08 - 0.11。根据评估结果,我们为OpenMRS开发了一个基于banner的NER模块,用于识别包括问题、测试和处理在内的语义概念。该模块适用于OpenMRS版本2. x。用户界面呈现简洁的临床笔记摘要,并允许用户过滤,搜索和查看概念的上下文。我们还开发了一个配套的web应用程序,使用OpenMRS的数据重新训练BANNER模型。该模块和源代码可在wiki.openmrs.org/display/docs/Visit+Notes+Analysis+Module上获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信