{"title":"A medical information extraction model with contrastive tuning and tagging layer training","authors":"Xiaowei Wang","doi":"10.1016/j.compbiomed.2025.110465","DOIUrl":null,"url":null,"abstract":"<div><div>Medical information extraction, as a core task in medical intelligent systems, focuses on extracting necessary structured information from clinical texts. In recent years, deep learning-based methods have become mainstream and often achieve superior extraction results. However, these existing methods have not fully tapped into the semantic potential of medical information categories, and most rely on a large amount of annotated data. This study proposes a novel semantic guided representation training model for medical information, which trains the representation of medical texts and medical information categories in the same semantic space by contrasting loss mechanisms, effectively reducing the need for annotated data. The experimental results show that our method objectives F1 value of 88.29 on CCKS2019 and 90.68 on CMeEE. Our method also exceeds the baseline by 4.07 on CCKS2019 and 4.95 on CMeEE.</div></div>","PeriodicalId":10578,"journal":{"name":"Computers in biology and medicine","volume":"193 ","pages":"Article 110465"},"PeriodicalIF":7.0000,"publicationDate":"2025-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in biology and medicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0010482525008169","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Medical information extraction, as a core task in medical intelligent systems, focuses on extracting necessary structured information from clinical texts. In recent years, deep learning-based methods have become mainstream and often achieve superior extraction results. However, these existing methods have not fully tapped into the semantic potential of medical information categories, and most rely on a large amount of annotated data. This study proposes a novel semantic guided representation training model for medical information, which trains the representation of medical texts and medical information categories in the same semantic space by contrasting loss mechanisms, effectively reducing the need for annotated data. The experimental results show that our method objectives F1 value of 88.29 on CCKS2019 and 90.68 on CMeEE. Our method also exceeds the baseline by 4.07 on CCKS2019 and 4.95 on CMeEE.
期刊介绍:
Computers in Biology and Medicine is an international forum for sharing groundbreaking advancements in the use of computers in bioscience and medicine. This journal serves as a medium for communicating essential research, instruction, ideas, and information regarding the rapidly evolving field of computer applications in these domains. By encouraging the exchange of knowledge, we aim to facilitate progress and innovation in the utilization of computers in biology and medicine.