{"title":"Medical Entity Extraction from Health Insurance Documents","authors":"Tianling Pu, Qifan Zhang, Junjie Yao, Yingjie Zhang","doi":"10.1109/ICBK50248.2020.00085","DOIUrl":null,"url":null,"abstract":"The task of named entity recognition is to identify certain types of entities with special meanings from the text. It is a basic task in natural language processing and the foundation of higher-level tasks such as relation extraction, knowledge graph, and question answering system. The correctness of the entity recognition has a huge influence on the effectiveness of the upper layer application.This paper mainly studies the problem of Chinese named entity recognition in the medical field. By extracting the information about the disease in the insurance text and labeling the entity of disease, treatment, and symptom, the data set for entity recognition is established. On the basis of the BILSTM-CRF model, we use different methods to improve the recognition effectiveness of the model. By incorporating word boundary information and adding attention mechanism in the BiLSTM layer, the effectiveness of entity recognition is further improved.","PeriodicalId":432857,"journal":{"name":"2020 IEEE International Conference on Knowledge Graph (ICKG)","volume":"61 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 IEEE International Conference on Knowledge Graph (ICKG)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICBK50248.2020.00085","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The task of named entity recognition is to identify certain types of entities with special meanings from the text. It is a basic task in natural language processing and the foundation of higher-level tasks such as relation extraction, knowledge graph, and question answering system. The correctness of the entity recognition has a huge influence on the effectiveness of the upper layer application.This paper mainly studies the problem of Chinese named entity recognition in the medical field. By extracting the information about the disease in the insurance text and labeling the entity of disease, treatment, and symptom, the data set for entity recognition is established. On the basis of the BILSTM-CRF model, we use different methods to improve the recognition effectiveness of the model. By incorporating word boundary information and adding attention mechanism in the BiLSTM layer, the effectiveness of entity recognition is further improved.