{"title":"中国电子病历细粒度生物医学关系提取的远程监控","authors":"Qing Zhao, Zhilong Ma, Jianqiang Li","doi":"10.1109/ICNSC55942.2022.10004079","DOIUrl":null,"url":null,"abstract":"Automatically extract relations between medical entity pairs is fundamental in biomedical research. Since the annotated dataset is very expensive, distant supervision provides an efficient solution to reduce the cost of annotation by utilizing rough corpus labeled with semantic knowledge base. However, two same entities mentioned in different sentences may express different relations, it is difficult for the traditional distant supervision methods to distinguish these different relations. In this paper, we propose a new model for biomedical relation extraction in Chinese EMRs. First, the distant supervision is used for coarse-grained relation labeling. Then, the fine-grained relations are annotated initially by measuring the distance between the contextual information of the relation instance to the semantic profile of each candidate fine-grained relation category. Finally, the high confidence fine-grained relation instances are selected as initial training set for PCNN model, in addition, a bootstrap learning is introduced in the training process to enhance the performance of fine-grained relation extraction. Experiments conducted on a real-word dataset and the results show that our method outperforms all baseline systems.","PeriodicalId":230499,"journal":{"name":"2022 IEEE International Conference on Networking, Sensing and Control (ICNSC)","volume":"80 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Distant supervision for fine-grained biomedical relation extraction from Chinese EMRs\",\"authors\":\"Qing Zhao, Zhilong Ma, Jianqiang Li\",\"doi\":\"10.1109/ICNSC55942.2022.10004079\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Automatically extract relations between medical entity pairs is fundamental in biomedical research. Since the annotated dataset is very expensive, distant supervision provides an efficient solution to reduce the cost of annotation by utilizing rough corpus labeled with semantic knowledge base. However, two same entities mentioned in different sentences may express different relations, it is difficult for the traditional distant supervision methods to distinguish these different relations. In this paper, we propose a new model for biomedical relation extraction in Chinese EMRs. First, the distant supervision is used for coarse-grained relation labeling. Then, the fine-grained relations are annotated initially by measuring the distance between the contextual information of the relation instance to the semantic profile of each candidate fine-grained relation category. Finally, the high confidence fine-grained relation instances are selected as initial training set for PCNN model, in addition, a bootstrap learning is introduced in the training process to enhance the performance of fine-grained relation extraction. Experiments conducted on a real-word dataset and the results show that our method outperforms all baseline systems.\",\"PeriodicalId\":230499,\"journal\":{\"name\":\"2022 IEEE International Conference on Networking, Sensing and Control (ICNSC)\",\"volume\":\"80 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-12-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE International Conference on Networking, Sensing and Control (ICNSC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICNSC55942.2022.10004079\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Networking, Sensing and Control (ICNSC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICNSC55942.2022.10004079","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Distant supervision for fine-grained biomedical relation extraction from Chinese EMRs
Automatically extract relations between medical entity pairs is fundamental in biomedical research. Since the annotated dataset is very expensive, distant supervision provides an efficient solution to reduce the cost of annotation by utilizing rough corpus labeled with semantic knowledge base. However, two same entities mentioned in different sentences may express different relations, it is difficult for the traditional distant supervision methods to distinguish these different relations. In this paper, we propose a new model for biomedical relation extraction in Chinese EMRs. First, the distant supervision is used for coarse-grained relation labeling. Then, the fine-grained relations are annotated initially by measuring the distance between the contextual information of the relation instance to the semantic profile of each candidate fine-grained relation category. Finally, the high confidence fine-grained relation instances are selected as initial training set for PCNN model, in addition, a bootstrap learning is introduced in the training process to enhance the performance of fine-grained relation extraction. Experiments conducted on a real-word dataset and the results show that our method outperforms all baseline systems.