{"title":"基于CRF-MT-Adapt和NER-MRC的中医命名实体识别","authors":"Hengyi Zheng, Bin Qin, Ming Xu","doi":"10.1109/CDS52072.2021.00068","DOIUrl":null,"url":null,"abstract":"Medical Named Entity Recognition is a fundamental component of understanding the medical free-text notes in Electronic Health Records, and it has become a popular research topic in both academia and industry. The China Conference on Knowledge Graph and Semantic Computing (CCKS) organizes a challenge for Medical Named Entity Recognition, aiming at extracting medical entity mentions and categorizing them into pre-defined classes. We propose a Multi-Task sequence labeling model with Adaptive Loss Weighting (CRF-MT-Adapt) to address the issue of low recall and a Named Entity Recognition model based on Machine Reading Comprehension (NER-MRC) to address the issue of long-span entity mentions. We experimentally demonstrate the state-of-the-art performance of the two proposed models and the ensemble even surpasses the strong baselines by at least 2% F-score. On the official test set, our best submission achieves an F-score of 90.51% and 95.96% under strict and relaxed criteria respectively.","PeriodicalId":380426,"journal":{"name":"2021 2nd International Conference on Computing and Data Science (CDS)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"Chinese Medical Named Entity Recognition using CRF-MT-Adapt and NER-MRC\",\"authors\":\"Hengyi Zheng, Bin Qin, Ming Xu\",\"doi\":\"10.1109/CDS52072.2021.00068\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Medical Named Entity Recognition is a fundamental component of understanding the medical free-text notes in Electronic Health Records, and it has become a popular research topic in both academia and industry. The China Conference on Knowledge Graph and Semantic Computing (CCKS) organizes a challenge for Medical Named Entity Recognition, aiming at extracting medical entity mentions and categorizing them into pre-defined classes. We propose a Multi-Task sequence labeling model with Adaptive Loss Weighting (CRF-MT-Adapt) to address the issue of low recall and a Named Entity Recognition model based on Machine Reading Comprehension (NER-MRC) to address the issue of long-span entity mentions. We experimentally demonstrate the state-of-the-art performance of the two proposed models and the ensemble even surpasses the strong baselines by at least 2% F-score. On the official test set, our best submission achieves an F-score of 90.51% and 95.96% under strict and relaxed criteria respectively.\",\"PeriodicalId\":380426,\"journal\":{\"name\":\"2021 2nd International Conference on Computing and Data Science (CDS)\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 2nd International Conference on Computing and Data Science (CDS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CDS52072.2021.00068\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 2nd International Conference on Computing and Data Science (CDS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CDS52072.2021.00068","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Chinese Medical Named Entity Recognition using CRF-MT-Adapt and NER-MRC
Medical Named Entity Recognition is a fundamental component of understanding the medical free-text notes in Electronic Health Records, and it has become a popular research topic in both academia and industry. The China Conference on Knowledge Graph and Semantic Computing (CCKS) organizes a challenge for Medical Named Entity Recognition, aiming at extracting medical entity mentions and categorizing them into pre-defined classes. We propose a Multi-Task sequence labeling model with Adaptive Loss Weighting (CRF-MT-Adapt) to address the issue of low recall and a Named Entity Recognition model based on Machine Reading Comprehension (NER-MRC) to address the issue of long-span entity mentions. We experimentally demonstrate the state-of-the-art performance of the two proposed models and the ensemble even surpasses the strong baselines by at least 2% F-score. On the official test set, our best submission achieves an F-score of 90.51% and 95.96% under strict and relaxed criteria respectively.