FedCE：一种用于异构医学命名实体识别的对比增强联邦学习方法

IF 3.5 1区计算机科学 Q1 Multidisciplinary

Tsinghua Science and Technology Pub Date : 2025-07-04 DOI:10.26599/TST.2024.9010186

Kai Chang;Hailong Sun;Jindou Wan;Naiqian Zhang;Yiming Liu;Kuo Yang;Zixin Shu;Jianan Xia;Xuezhong Zhou

{"title":"FedCE：一种用于异构医学命名实体识别的对比增强联邦学习方法","authors":"Kai Chang;Hailong Sun;Jindou Wan;Naiqian Zhang;Yiming Liu;Kuo Yang;Zixin Shu;Jianan Xia;Xuezhong Zhou","doi":"10.26599/TST.2024.9010186","DOIUrl":null,"url":null,"abstract":"Medical Named Entity Recognition (NER) plays a crucial role in attaining precise patient portraits as well as providing support for intelligent diagnosis and treatment decisions. Federated Learning (FL) enables collaborative modeling and training across multiple endpoints without exposing the original data. However, the statistical heterogeneity exhibited by clinical medical text records poses a challenge for FL methods to support the training of NER models in such scenarios. We propose a Federated Contrast Enhancement (FedCE) method for NER to address the challenges faced by non-large-scale pre-trained models in FL for label-heterogeneous. The method leverages a multi-view encoder structure to capture both global and local semantic information, and employs contrastive learning to enhance the interoperability of global knowledge and local context. We evaluate the performance of the FedCE method on three real-world clinical record datasets. We investigate the impact of factors, such as pooling methods, maximum input text length, and training rounds on FedCE. Additionally, we assess how well FedCE adapts to the base NER models and evaluate its generalization performance. The experimental results show that the FedCE method has obvious advantages and can be effectively applied to various basic models, which is of great theoretical and practical significance for advancing FL in healthcare settings.","PeriodicalId":48690,"journal":{"name":"Tsinghua Science and Technology","volume":"30 6","pages":"2384-2398"},"PeriodicalIF":3.5000,"publicationDate":"2025-07-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11072110","citationCount":"0","resultStr":"{\"title\":\"FedCE: A Contrast Enhancement Federated Learning Method for Heterogeneous Medical Named Entity Recognition\",\"authors\":\"Kai Chang;Hailong Sun;Jindou Wan;Naiqian Zhang;Yiming Liu;Kuo Yang;Zixin Shu;Jianan Xia;Xuezhong Zhou\",\"doi\":\"10.26599/TST.2024.9010186\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Medical Named Entity Recognition (NER) plays a crucial role in attaining precise patient portraits as well as providing support for intelligent diagnosis and treatment decisions. Federated Learning (FL) enables collaborative modeling and training across multiple endpoints without exposing the original data. However, the statistical heterogeneity exhibited by clinical medical text records poses a challenge for FL methods to support the training of NER models in such scenarios. We propose a Federated Contrast Enhancement (FedCE) method for NER to address the challenges faced by non-large-scale pre-trained models in FL for label-heterogeneous. The method leverages a multi-view encoder structure to capture both global and local semantic information, and employs contrastive learning to enhance the interoperability of global knowledge and local context. We evaluate the performance of the FedCE method on three real-world clinical record datasets. We investigate the impact of factors, such as pooling methods, maximum input text length, and training rounds on FedCE. Additionally, we assess how well FedCE adapts to the base NER models and evaluate its generalization performance. The experimental results show that the FedCE method has obvious advantages and can be effectively applied to various basic models, which is of great theoretical and practical significance for advancing FL in healthcare settings.\",\"PeriodicalId\":48690,\"journal\":{\"name\":\"Tsinghua Science and Technology\",\"volume\":\"30 6\",\"pages\":\"2384-2398\"},\"PeriodicalIF\":3.5000,\"publicationDate\":\"2025-07-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=11072110\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Tsinghua Science and Technology\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11072110/\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"Multidisciplinary\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Tsinghua Science and Technology","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/11072110/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Multidisciplinary","Score":null,"Total":0}

引用次数: 0

摘要

医学命名实体识别（NER）在获得精确的患者画像以及为智能诊断和治疗决策提供支持方面发挥着至关重要的作用。联邦学习（FL）支持跨多个端点的协作建模和训练，而无需暴露原始数据。然而，临床医学文本记录显示的统计异质性对FL方法在这种情况下支持NER模型的训练提出了挑战。我们提出了一种用于NER的联邦对比度增强（FedCE）方法，以解决非大规模预训练模型在标签异构的FL中面临的挑战。该方法利用多视图编码器结构捕获全局和局部语义信息，并利用对比学习增强全局知识和局部上下文的互操作性。我们在三个真实的临床记录数据集上评估了FedCE方法的性能。我们研究了诸如池化方法、最大输入文本长度和FedCE训练回合等因素的影响。此外，我们评估了FedCE对基本NER模型的适应程度，并评估了其泛化性能。实验结果表明，FedCE方法具有明显的优势，可以有效地应用于各种基础模型，这对于推进FL在医疗环境中的应用具有重要的理论和实践意义。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

FedCE: A Contrast Enhancement Federated Learning Method for Heterogeneous Medical Named Entity Recognition

Medical Named Entity Recognition (NER) plays a crucial role in attaining precise patient portraits as well as providing support for intelligent diagnosis and treatment decisions. Federated Learning (FL) enables collaborative modeling and training across multiple endpoints without exposing the original data. However, the statistical heterogeneity exhibited by clinical medical text records poses a challenge for FL methods to support the training of NER models in such scenarios. We propose a Federated Contrast Enhancement (FedCE) method for NER to address the challenges faced by non-large-scale pre-trained models in FL for label-heterogeneous. The method leverages a multi-view encoder structure to capture both global and local semantic information, and employs contrastive learning to enhance the interoperability of global knowledge and local context. We evaluate the performance of the FedCE method on three real-world clinical record datasets. We investigate the impact of factors, such as pooling methods, maximum input text length, and training rounds on FedCE. Additionally, we assess how well FedCE adapts to the base NER models and evaluate its generalization performance. The experimental results show that the FedCE method has obvious advantages and can be effectively applied to various basic models, which is of great theoretical and practical significance for advancing FL in healthcare settings.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Tsinghua Science and Technology COMPUTER SCIENCE, INFORMATION SYSTEMSCOMPU-COMPUTER SCIENCE, SOFTWARE ENGINEERING

CiteScore

10.20

自引率

10.60%

发文量

2340

期刊介绍： Tsinghua Science and Technology (Tsinghua Sci Technol) started publication in 1996. It is an international academic journal sponsored by Tsinghua University and is published bimonthly. This journal aims at presenting the up-to-date scientific achievements in computer science, electronic engineering, and other IT fields. Contributions all over the world are welcome.