用大语言模型从临床叙述中识别阿片类药物过量和阿片类药物使用障碍及其相关信息

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science Pub Date : 2025-06-10 eCollection Date: 2025-01-01

Daniel Paredes, Sankalp Talankar, Cheng Peng, Patrick Balian, Motomoti Lewis, Shunhun Yan, Wen-Shan Tsai PharmD, Ching-Yuan Chang, Debbie L Wilson, Wei-Hsuan Lo-Ciganic, Yonghui Wu

{"title":"用大语言模型从临床叙述中识别阿片类药物过量和阿片类药物使用障碍及其相关信息","authors":"Daniel Paredes, Sankalp Talankar, Cheng Peng, Patrick Balian, Motomoti Lewis, Shunhun Yan, Wen-Shan Tsai PharmD, Ching-Yuan Chang, Debbie L Wilson, Wei-Hsuan Lo-Ciganic, Yonghui Wu","doi":"","DOIUrl":null,"url":null,"abstract":"Opioid overdose and opioid use disorder (OUD) remain a growing public health issue in the United States, affecting 6.1 million individuals in 2022, more than doubling the 2.5 million from 2021. Accurately identifying the opioid overdose and OUD related information is critical to study the outcomes and develop interventions. This study aims to identify opioid overdose and OUD mentions and their related information from clinical narratives. We compared encoder-based large language models (LLMs) and decoder-based generative LLMs in extracting nine crucial concepts related with opioid overdose and OUD including problematic opioid use. Through a cost-effective p-tuning algorithm, our decoder-based generative LLM, GatorTronGPT, achieved the best strict/lenient F1-score of 0.8637, and 0.9057, demonstrating the efficient of using generative LLMs for opioid overdose/OUD related information extraction. This study provided a tool to systematically extract opioid overdose, OUD, and their related information to facilitate opioid-related studies using clinical narratives.","PeriodicalId":72181,"journal":{"name":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","volume":"2025 ","pages":"414-421"},"PeriodicalIF":0.0000,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150707/pdf/","citationCount":"0","resultStr":"{\"title\":\"Identifying Opioid Overdose and Opioid Use Disorder and Related Information from Clinical Narratives Using Large Language Models.\",\"authors\":\"Daniel Paredes, Sankalp Talankar, Cheng Peng, Patrick Balian, Motomoti Lewis, Shunhun Yan, Wen-Shan Tsai PharmD, Ching-Yuan Chang, Debbie L Wilson, Wei-Hsuan Lo-Ciganic, Yonghui Wu\",\"doi\":\"\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Opioid overdose and opioid use disorder (OUD) remain a growing public health issue in the United States, affecting 6.1 million individuals in 2022, more than doubling the 2.5 million from 2021. Accurately identifying the opioid overdose and OUD related information is critical to study the outcomes and develop interventions. This study aims to identify opioid overdose and OUD mentions and their related information from clinical narratives. We compared encoder-based large language models (LLMs) and decoder-based generative LLMs in extracting nine crucial concepts related with opioid overdose and OUD including problematic opioid use. Through a cost-effective p-tuning algorithm, our decoder-based generative LLM, GatorTronGPT, achieved the best strict/lenient F1-score of 0.8637, and 0.9057, demonstrating the efficient of using generative LLMs for opioid overdose/OUD related information extraction. This study provided a tool to systematically extract opioid overdose, OUD, and their related information to facilitate opioid-related studies using clinical narratives.\",\"PeriodicalId\":72181,\"journal\":{\"name\":\"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science\",\"volume\":\"2025 \",\"pages\":\"414-421\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-06-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12150707/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science","FirstCategoryId":"1085","ListUrlMain":"","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

阿片类药物过量和阿片类药物使用障碍（OUD）在美国仍然是一个日益严重的公共卫生问题，2022年影响610万人，比2021年的250万人增加了一倍多。准确识别阿片类药物过量和OUD相关信息对于研究结果和制定干预措施至关重要。本研究旨在从临床叙述中识别阿片类药物过量和OUD提及及其相关信息。我们比较了基于编码器的大型语言模型（llm）和基于解码器的生成式llm，以提取与阿片类药物过量和OUD相关的九个关键概念，包括阿片类药物的问题使用。通过高性价比的p-tuning算法，我们基于解码器的生成式LLM GatorTronGPT获得了最佳的严格/宽松f1分数0.8637和0.9057，证明了使用生成式LLM进行阿片类药物过量/OUD相关信息提取的有效性。本研究为系统提取阿片类药物过量、OUD及其相关信息提供了工具，为使用临床叙述进行阿片类药物相关研究提供了便利。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

本刊更多论文

Identifying Opioid Overdose and Opioid Use Disorder and Related Information from Clinical Narratives Using Large Language Models.

Opioid overdose and opioid use disorder (OUD) remain a growing public health issue in the United States, affecting 6.1 million individuals in 2022, more than doubling the 2.5 million from 2021. Accurately identifying the opioid overdose and OUD related information is critical to study the outcomes and develop interventions. This study aims to identify opioid overdose and OUD mentions and their related information from clinical narratives. We compared encoder-based large language models (LLMs) and decoder-based generative LLMs in extracting nine crucial concepts related with opioid overdose and OUD including problematic opioid use. Through a cost-effective p-tuning algorithm, our decoder-based generative LLM, GatorTronGPT, achieved the best strict/lenient F1-score of 0.8637, and 0.9057, demonstrating the efficient of using generative LLMs for opioid overdose/OUD related information extraction. This study provided a tool to systematically extract opioid overdose, OUD, and their related information to facilitate opioid-related studies using clinical narratives.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science

自引率

0.00%

发文量