基于迁移学习的电子病历命名实体识别

Kunli Zhang, Chenghao Zhang, Yajuan Ye, Hongying Zan, Xiaomei Liu
{"title":"基于迁移学习的电子病历命名实体识别","authors":"Kunli Zhang, Chenghao Zhang, Yajuan Ye, Hongying Zan, Xiaomei Liu","doi":"10.1145/3560071.3560086","DOIUrl":null,"url":null,"abstract":"Named entity recognition is the first step in clinical electronic medical record text mining, which is significant for clinical decision support and personalized medicine. However, the lack of annotated electronic medical record datasets limits the application of pre-trained language models and deep neural networks in this field. To alleviate the problem of data scarcity, we propose T-RoBERTa-BiLSTM-CRF, a transfer learning-based electronic medical record entity recognition model, which aggregates the characteristics of medical data from different sources and uses a small amount of electronic medical record data as target data for further training. Compared with existing models, our approach can model medical entities more effectively, and the extensive comparative experiments on the CCKS 2019 and DEMRC datasets show the effectiveness of our approach.","PeriodicalId":249276,"journal":{"name":"Proceedings of the 2022 International Conference on Intelligent Medicine and Health","volume":"92 26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Named Entity Recognition in Electronic Medical Records Based on Transfer Learning\",\"authors\":\"Kunli Zhang, Chenghao Zhang, Yajuan Ye, Hongying Zan, Xiaomei Liu\",\"doi\":\"10.1145/3560071.3560086\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Named entity recognition is the first step in clinical electronic medical record text mining, which is significant for clinical decision support and personalized medicine. However, the lack of annotated electronic medical record datasets limits the application of pre-trained language models and deep neural networks in this field. To alleviate the problem of data scarcity, we propose T-RoBERTa-BiLSTM-CRF, a transfer learning-based electronic medical record entity recognition model, which aggregates the characteristics of medical data from different sources and uses a small amount of electronic medical record data as target data for further training. Compared with existing models, our approach can model medical entities more effectively, and the extensive comparative experiments on the CCKS 2019 and DEMRC datasets show the effectiveness of our approach.\",\"PeriodicalId\":249276,\"journal\":{\"name\":\"Proceedings of the 2022 International Conference on Intelligent Medicine and Health\",\"volume\":\"92 26 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-08-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2022 International Conference on Intelligent Medicine and Health\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3560071.3560086\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 International Conference on Intelligent Medicine and Health","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3560071.3560086","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

命名实体识别是临床电子病历文本挖掘的第一步,对临床决策支持和个性化医疗具有重要意义。然而,缺乏带注释的电子病历数据集限制了预训练语言模型和深度神经网络在该领域的应用。为了缓解数据稀缺的问题,我们提出了一种基于迁移学习的电子病历实体识别模型T-RoBERTa-BiLSTM-CRF,该模型综合了不同来源医疗数据的特征,使用少量的电子病历数据作为目标数据进行进一步训练。与现有模型相比,我们的方法可以更有效地建模医疗实体,并且在CCKS 2019和DEMRC数据集上的大量对比实验表明了我们的方法的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Named Entity Recognition in Electronic Medical Records Based on Transfer Learning
Named entity recognition is the first step in clinical electronic medical record text mining, which is significant for clinical decision support and personalized medicine. However, the lack of annotated electronic medical record datasets limits the application of pre-trained language models and deep neural networks in this field. To alleviate the problem of data scarcity, we propose T-RoBERTa-BiLSTM-CRF, a transfer learning-based electronic medical record entity recognition model, which aggregates the characteristics of medical data from different sources and uses a small amount of electronic medical record data as target data for further training. Compared with existing models, our approach can model medical entities more effectively, and the extensive comparative experiments on the CCKS 2019 and DEMRC datasets show the effectiveness of our approach.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信