CACER: Clinical concept Annotations for Cancer Events and Relations.

IF 4.7 2区医学 Q1 COMPUTER SCIENCE, INFORMATION SYSTEMS

Journal of the American Medical Informatics Association Pub Date : 2024-09-03 DOI:10.1093/jamia/ocae231

Yujuan Velvin Fu, Giridhar Kaushik Ramachandran, Ahmad Halwani, Bridget T McInnes, Fei Xia, Kevin Lybarger, Meliha Yetisgen, Özlem Uzuner

{"title":"CACER: Clinical concept Annotations for Cancer Events and Relations.","authors":"Yujuan Velvin Fu, Giridhar Kaushik Ramachandran, Ahmad Halwani, Bridget T McInnes, Fei Xia, Kevin Lybarger, Meliha Yetisgen, Özlem Uzuner","doi":"10.1093/jamia/ocae231","DOIUrl":null,"url":null,"abstract":"Objective: Clinical notes contain unstructured representations of patient histories, including the relationships between medical problems and prescription drugs. To investigate the relationship between cancer drugs and their associated symptom burden, we extract structured, semantic representations of medical problem and drug information from the clinical narratives of oncology notes.Materials and methods: We present Clinical concept Annotations for Cancer Events and Relations (CACER), a novel corpus with fine-grained annotations for over 48 000 medical problems and drug events and 10 000 drug-problem and problem-problem relations. Leveraging CACER, we develop and evaluate transformer-based information extraction models such as Bidirectional Encoder Representations from Transformers (BERT), Fine-tuned Language Net Text-To-Text Transfer Transformer (Flan-T5), Large Language Model Meta AI (Llama3), and Generative Pre-trained Transformers-4 (GPT-4) using fine-tuning and in-context learning (ICL).Results: In event extraction, the fine-tuned BERT and Llama3 models achieved the highest performance at 88.2-88.0 F1, which is comparable to the inter-annotator agreement (IAA) of 88.4 F1. In relation extraction, the fine-tuned BERT, Flan-T5, and Llama3 achieved the highest performance at 61.8-65.3 F1. GPT-4 with ICL achieved the worst performance across both tasks.Discussion: The fine-tuned models significantly outperformed GPT-4 in ICL, highlighting the importance of annotated training data and model optimization. Furthermore, the BERT models performed similarly to Llama3. For our task, large language models offer no performance advantage over the smaller BERT models.Conclusions: We introduce CACER, a novel corpus with fine-grained annotations for medical problems, drugs, and their relationships in clinical narratives of oncology notes. State-of-the-art transformer models achieved performance comparable to IAA for several extraction tasks.","PeriodicalId":50016,"journal":{"name":"Journal of the American Medical Informatics Association","volume":null,"pages":null},"PeriodicalIF":4.7000,"publicationDate":"2024-09-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the American Medical Informatics Association","FirstCategoryId":"91","ListUrlMain":"https://doi.org/10.1093/jamia/ocae231","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Objective: Clinical notes contain unstructured representations of patient histories, including the relationships between medical problems and prescription drugs. To investigate the relationship between cancer drugs and their associated symptom burden, we extract structured, semantic representations of medical problem and drug information from the clinical narratives of oncology notes.

Materials and methods: We present Clinical concept Annotations for Cancer Events and Relations (CACER), a novel corpus with fine-grained annotations for over 48 000 medical problems and drug events and 10 000 drug-problem and problem-problem relations. Leveraging CACER, we develop and evaluate transformer-based information extraction models such as Bidirectional Encoder Representations from Transformers (BERT), Fine-tuned Language Net Text-To-Text Transfer Transformer (Flan-T5), Large Language Model Meta AI (Llama3), and Generative Pre-trained Transformers-4 (GPT-4) using fine-tuning and in-context learning (ICL).

Results: In event extraction, the fine-tuned BERT and Llama3 models achieved the highest performance at 88.2-88.0 F1, which is comparable to the inter-annotator agreement (IAA) of 88.4 F1. In relation extraction, the fine-tuned BERT, Flan-T5, and Llama3 achieved the highest performance at 61.8-65.3 F1. GPT-4 with ICL achieved the worst performance across both tasks.

Discussion: The fine-tuned models significantly outperformed GPT-4 in ICL, highlighting the importance of annotated training data and model optimization. Furthermore, the BERT models performed similarly to Llama3. For our task, large language models offer no performance advantage over the smaller BERT models.

Conclusions: We introduce CACER, a novel corpus with fine-grained annotations for medical problems, drugs, and their relationships in clinical narratives of oncology notes. State-of-the-art transformer models achieved performance comparable to IAA for several extraction tasks.

查看原文本刊更多论文

CACER：癌症事件和关系的临床概念注释。

目的：临床笔记包含病人病史的非结构化表述，包括医疗问题和处方药之间的关系。为了研究癌症药物与其相关症状负担之间的关系，我们从肿瘤笔记的临床叙述中提取了医疗问题和药物信息的结构化语义表征：我们提出了癌症事件和关系的临床概念注释（CACER），这是一个新颖的语料库，包含 48000 多个医疗问题和药物事件的细粒度注释，以及 10000 个药物-问题和问题-问题关系的注释。利用 CACER，我们开发并评估了基于转换器的信息提取模型，如转换器双向编码器表示（BERT）、微调语言网文本到文本传输转换器（Flan-T5）、大型语言模型元人工智能（Llama3），以及使用微调和上下文学习（ICL）的生成预训练转换器-4（GPT-4）：在事件提取方面，经过微调的 BERT 和 Llama3 模型取得了 88.2-88.0 F1 的最高性能，与 88.4 F1 的标注者间一致性（IAA）相当。在关系提取方面，微调 BERT、Flan-T5 和 Llama3 的性能最高，达到 61.8-65.3 F1。带有 ICL 的 GPT-4 在这两项任务中表现最差：讨论：经过微调的模型在 ICL 中的表现明显优于 GPT-4，这凸显了注释训练数据和模型优化的重要性。此外，BERT 模型的表现与 Llama3 相似。就我们的任务而言，大型语言模型与小型 BERT 模型相比没有性能优势：我们介绍了 CACER，这是一个新颖的语料库，其中对肿瘤学笔记临床叙述中的医疗问题、药物及其关系进行了细粒度注释。在几项提取任务中，最先进的转换器模型取得了与 IAA 相当的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of the American Medical Informatics Association 医学-计算机：跨学科应用

CiteScore

14.50

自引率

7.80%

发文量

230

审稿时长

3-8 weeks

期刊介绍： JAMIA is AMIA''s premier peer-reviewed journal for biomedical and health informatics. Covering the full spectrum of activities in the field, JAMIA includes informatics articles in the areas of clinical care, clinical research, translational science, implementation science, imaging, education, consumer health, public health, and policy. JAMIA''s articles describe innovative informatics research and systems that help to advance biomedical science and to promote health. Case reports, perspectives and reviews also help readers stay connected with the most important informatics developments in implementation, policy and education.

文献相关原料

公司名称	产品信息	采购帮参考价格