A medical information extraction model with contrastive tuning and tagging layer training

IF 7 2区医学 Q1 BIOLOGY

Computers in biology and medicine Pub Date : 2025-05-29 DOI:10.1016/j.compbiomed.2025.110465

Xiaowei Wang

引用次数: 0

Abstract

Medical information extraction, as a core task in medical intelligent systems, focuses on extracting necessary structured information from clinical texts. In recent years, deep learning-based methods have become mainstream and often achieve superior extraction results. However, these existing methods have not fully tapped into the semantic potential of medical information categories, and most rely on a large amount of annotated data. This study proposes a novel semantic guided representation training model for medical information, which trains the representation of medical texts and medical information categories in the same semantic space by contrasting loss mechanisms, effectively reducing the need for annotated data. The experimental results show that our method objectives F1 value of 88.29 on CCKS2019 and 90.68 on CMeEE. Our method also exceeds the baseline by 4.07 on CCKS2019 and 4.95 on CMeEE.

查看原文本刊更多论文

一种带有对比调优和标记层训练的医学信息提取模型

医学信息提取是医疗智能系统的核心任务，主要是从临床文本中提取必要的结构化信息。近年来，基于深度学习的方法已经成为主流，并且往往能够取得优异的提取效果。然而，这些现有的方法并没有充分挖掘医学信息类别的语义潜力，而且大多依赖于大量的注释数据。本研究提出了一种新的医学信息语义引导表示训练模型，通过对比损失机制来训练医学文本和医学信息类别在同一语义空间中的表示，有效减少了对标注数据的需求。实验结果表明，我们的方法在CCKS2019上的F1值为88.29，在CMeEE上的F1值为90.68。我们的方法在CCKS2019上也超过基线4.07，在CMeEE上也超过基线4.95。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computers in biology and medicine 工程技术-工程：生物医学

CiteScore

11.70

自引率

10.40%

发文量

1086

审稿时长

74 days

期刊介绍： Computers in Biology and Medicine is an international forum for sharing groundbreaking advancements in the use of computers in bioscience and medicine. This journal serves as a medium for communicating essential research, instruction, ideas, and information regarding the rapidly evolving field of computer applications in these domains. By encouraging the exchange of knowledge, we aim to facilitate progress and innovation in the utilization of computers in biology and medicine.