Rui Wang, Jinfeng Zhao, Lizhi Peng, Bo Yang, Lin Wang, Baosheng Li
{"title":"Medical entity recognition of Esophageal Carcinoma based on word clustering","authors":"Rui Wang, Jinfeng Zhao, Lizhi Peng, Bo Yang, Lin Wang, Baosheng Li","doi":"10.1109/SPAC46244.2018.8965515","DOIUrl":null,"url":null,"abstract":"Electronic Medical Records (EMRs) is the core of medical information system in hospital. EMRs arises from the medical institution, and large amount of clinical data generated every day. Due to the EMRs contains many medical entities and clinical information of the patient, by analyzing and mining the texts data, medical knowledge which closely related to patients or certain diseases can be obtained. In this paper, we ues the EMRs of patients with Esophageal Carcinoma(EC), which include the clinical symptoms of the patients, tests they have undergone, results of the examinations, and the diagnosis and treatment plan. In this paper, word vector training is carried out for large-scale electronic medical record corpus by means of the skip model of word2vec deep learning tool. Then applying the features data to the k-means clustering algorithm to identifying medical entities about EC by word clustering. As the first step of medical knowledge mining, medical entity recognition is the premise of extracting the semantic relationship implied in medical text and structuring EMRs.","PeriodicalId":360369,"journal":{"name":"2018 International Conference on Security, Pattern Analysis, and Cybernetics (SPAC)","volume":"10 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 International Conference on Security, Pattern Analysis, and Cybernetics (SPAC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SPAC46244.2018.8965515","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
Electronic Medical Records (EMRs) is the core of medical information system in hospital. EMRs arises from the medical institution, and large amount of clinical data generated every day. Due to the EMRs contains many medical entities and clinical information of the patient, by analyzing and mining the texts data, medical knowledge which closely related to patients or certain diseases can be obtained. In this paper, we ues the EMRs of patients with Esophageal Carcinoma(EC), which include the clinical symptoms of the patients, tests they have undergone, results of the examinations, and the diagnosis and treatment plan. In this paper, word vector training is carried out for large-scale electronic medical record corpus by means of the skip model of word2vec deep learning tool. Then applying the features data to the k-means clustering algorithm to identifying medical entities about EC by word clustering. As the first step of medical knowledge mining, medical entity recognition is the premise of extracting the semantic relationship implied in medical text and structuring EMRs.