Medical entity recognition of Esophageal Carcinoma based on word clustering

Rui Wang, Jinfeng Zhao, Lizhi Peng, Bo Yang, Lin Wang, Baosheng Li
{"title":"Medical entity recognition of Esophageal Carcinoma based on word clustering","authors":"Rui Wang, Jinfeng Zhao, Lizhi Peng, Bo Yang, Lin Wang, Baosheng Li","doi":"10.1109/SPAC46244.2018.8965515","DOIUrl":null,"url":null,"abstract":"Electronic Medical Records (EMRs) is the core of medical information system in hospital. EMRs arises from the medical institution, and large amount of clinical data generated every day. Due to the EMRs contains many medical entities and clinical information of the patient, by analyzing and mining the texts data, medical knowledge which closely related to patients or certain diseases can be obtained. In this paper, we ues the EMRs of patients with Esophageal Carcinoma(EC), which include the clinical symptoms of the patients, tests they have undergone, results of the examinations, and the diagnosis and treatment plan. In this paper, word vector training is carried out for large-scale electronic medical record corpus by means of the skip model of word2vec deep learning tool. Then applying the features data to the k-means clustering algorithm to identifying medical entities about EC by word clustering. As the first step of medical knowledge mining, medical entity recognition is the premise of extracting the semantic relationship implied in medical text and structuring EMRs.","PeriodicalId":360369,"journal":{"name":"2018 International Conference on Security, Pattern Analysis, and Cybernetics (SPAC)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Security, Pattern Analysis, and Cybernetics (SPAC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPAC46244.2018.8965515","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

Electronic Medical Records (EMRs) is the core of medical information system in hospital. EMRs arises from the medical institution, and large amount of clinical data generated every day. Due to the EMRs contains many medical entities and clinical information of the patient, by analyzing and mining the texts data, medical knowledge which closely related to patients or certain diseases can be obtained. In this paper, we ues the EMRs of patients with Esophageal Carcinoma(EC), which include the clinical symptoms of the patients, tests they have undergone, results of the examinations, and the diagnosis and treatment plan. In this paper, word vector training is carried out for large-scale electronic medical record corpus by means of the skip model of word2vec deep learning tool. Then applying the features data to the k-means clustering algorithm to identifying medical entities about EC by word clustering. As the first step of medical knowledge mining, medical entity recognition is the premise of extracting the semantic relationship implied in medical text and structuring EMRs.
基于词聚类的食管癌医学实体识别
电子病案是医院医疗信息系统的核心。电子病历产生于医疗机构,每天产生大量的临床数据。由于电子病历中包含大量的医疗实体和患者的临床信息,通过对文本数据的分析和挖掘,可以获得与患者或特定疾病密切相关的医学知识。本文采用食管癌患者的电子病历,包括患者的临床症状、所做的检查、检查结果以及诊断和治疗方案。本文利用word2vec深度学习工具的跳跃模型,对大型电子病历语料库进行词向量训练。然后将特征数据应用到k-means聚类算法中,通过词聚类对EC的医疗实体进行识别。医学实体识别是医学知识挖掘的第一步,是提取医学文本隐含的语义关系和构建电子病历的前提。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信