Cross Documents Concept Augmentation

M. Vasiu, L. Marghescu, Ioana Barbantan, R. Potolea
{"title":"Cross Documents Concept Augmentation","authors":"M. Vasiu, L. Marghescu, Ioana Barbantan, R. Potolea","doi":"10.1109/ICCP.2018.8516582","DOIUrl":null,"url":null,"abstract":"The current paper proposes a strategy for exploring and integrating related information extracted from unstructured documents with different degree of confidence, standardization and representation. The strategy was instantiated on the medical domain and designed for the English language. The goal of the proposed strategy was of augmenting the therapeutic information from patient leaflets with information extracted from clinical records. The approach proved to be a sound one as the information from the clinical records aligns with the information in the standardized sources. It confirmed the assumption that we can derive drug repositioning from clinical records and thus augmenting the existing medical knowledge. The reported metrics <95.14% precision, 83.3% recall>for patient leaflets and <94.07% precision, 87.27% recall >for EHRs measured for the concept extraction strategy, further support a good performance for the entities correlation approach. The degree of correlation between the extracted information from the two data sources reported as matches is of 85%.","PeriodicalId":259007,"journal":{"name":"2018 IEEE 14th International Conference on Intelligent Computer Communication and Processing (ICCP)","volume":"71 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 14th International Conference on Intelligent Computer Communication and Processing (ICCP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCP.2018.8516582","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

The current paper proposes a strategy for exploring and integrating related information extracted from unstructured documents with different degree of confidence, standardization and representation. The strategy was instantiated on the medical domain and designed for the English language. The goal of the proposed strategy was of augmenting the therapeutic information from patient leaflets with information extracted from clinical records. The approach proved to be a sound one as the information from the clinical records aligns with the information in the standardized sources. It confirmed the assumption that we can derive drug repositioning from clinical records and thus augmenting the existing medical knowledge. The reported metrics <95.14% precision, 83.3% recall>for patient leaflets and <94.07% precision, 87.27% recall >for EHRs measured for the concept extraction strategy, further support a good performance for the entities correlation approach. The degree of correlation between the extracted information from the two data sources reported as matches is of 85%.
跨文档概念增强
本文提出了一种从不同置信度、标准化和代表性的非结构化文档中提取相关信息的挖掘和集成策略。该策略在医学领域实例化,并为英语语言设计。所提出的策略的目标是从临床记录中提取的信息来增强患者小叶中的治疗信息。由于临床记录的信息与标准化来源的信息一致,该方法被证明是一种合理的方法。它证实了我们可以从临床记录中得出药物重新定位的假设,从而增加了现有的医学知识。为概念提取策略测量的患者宣传单和电子病历的报告指标进一步支持实体关联方法的良好性能。从报告为匹配的两个数据源中提取的信息之间的相关性为85%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信