利用大型语言模型从处方药标签中提取药物安全信息。

IF 3.8 2区医学 Q1 PHARMACOLOGY & PHARMACY

Drug Safety Pub Date : 2025-09-02 DOI:10.1007/s40264-025-01594-x

Undina Gisladottir, Michael Zietz, Sophia Kivelson, Yutaro Tanaka, Gaurav Sirdeshmukh, Kathleen LaRow Brown, Nicholas P Tatonetti

{"title":"利用大型语言模型从处方药标签中提取药物安全信息。","authors":"Undina Gisladottir, Michael Zietz, Sophia Kivelson, Yutaro Tanaka, Gaurav Sirdeshmukh, Kathleen LaRow Brown, Nicholas P Tatonetti","doi":"10.1007/s40264-025-01594-x","DOIUrl":null,"url":null,"abstract":"Introduction: Adverse drug reactions (ADRs), including those resulting from drug interactions, remain a leading cause of morbidity and mortality. Structured product labels (SPLs) serve as a primary source for drug safety information. Having machine-readable product labels, including adverse reactions (ARs) and drug interactions, readily available would allow researchers to streamline medication safety studies. However, extracting this information is complex and requires the use of natural language processing (NLP) methods.Objective: In this study, we explored the application of generative language models in the extraction of drug safety information from SPLs.Methods: We compared multiple generative LLMs (GPT, Llama, and Mixtral) to two baseline methods in the task of extracting adverse reactions (ARs) from SPLs. We explored various factors, such as prompting strategies and term complexity, that impact the performance of these models in the extraction of ARs. Finally, we explored the generative models' capacity to extract drug interactions from a separate section of SPLs without additional fine-tuning or training, demonstrating their flexibility and adaptability for information retrieval.Results: We found that generative language models, specifically GPT-4, are able to match or exceed the performance of previous state-of-the-art models without additional training or fine-tuning. Additionally, we found that the specific SPL section, surrounding context, and complexity of the AR term impacted the extraction performance. Finally, we demonstrated the generalizability of these models by applying them to a separate task of extracting drug names from the drug interaction section where curated training data are not available.Conclusion: Generative language models demonstrate significant potential for automating drug safety information extraction from SPLs, offering a promising avenue for improving post-market surveillance and reducing ADRs. Future work should focus on refining prompting strategies and expanding the models' capabilities to handle increasingly complex and nuanced drug safety information.","PeriodicalId":11382,"journal":{"name":"Drug Safety","volume":" ","pages":""},"PeriodicalIF":3.8000,"publicationDate":"2025-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Leveraging Large Language Models in Extracting Drug Safety Information from Prescription Drug Labels.\",\"authors\":\"Undina Gisladottir, Michael Zietz, Sophia Kivelson, Yutaro Tanaka, Gaurav Sirdeshmukh, Kathleen LaRow Brown, Nicholas P Tatonetti\",\"doi\":\"10.1007/s40264-025-01594-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Introduction: Adverse drug reactions (ADRs), including those resulting from drug interactions, remain a leading cause of morbidity and mortality. Structured product labels (SPLs) serve as a primary source for drug safety information. Having machine-readable product labels, including adverse reactions (ARs) and drug interactions, readily available would allow researchers to streamline medication safety studies. However, extracting this information is complex and requires the use of natural language processing (NLP) methods.Objective: In this study, we explored the application of generative language models in the extraction of drug safety information from SPLs.Methods: We compared multiple generative LLMs (GPT, Llama, and Mixtral) to two baseline methods in the task of extracting adverse reactions (ARs) from SPLs. We explored various factors, such as prompting strategies and term complexity, that impact the performance of these models in the extraction of ARs. Finally, we explored the generative models' capacity to extract drug interactions from a separate section of SPLs without additional fine-tuning or training, demonstrating their flexibility and adaptability for information retrieval.Results: We found that generative language models, specifically GPT-4, are able to match or exceed the performance of previous state-of-the-art models without additional training or fine-tuning. Additionally, we found that the specific SPL section, surrounding context, and complexity of the AR term impacted the extraction performance. Finally, we demonstrated the generalizability of these models by applying them to a separate task of extracting drug names from the drug interaction section where curated training data are not available.Conclusion: Generative language models demonstrate significant potential for automating drug safety information extraction from SPLs, offering a promising avenue for improving post-market surveillance and reducing ADRs. Future work should focus on refining prompting strategies and expanding the models' capabilities to handle increasingly complex and nuanced drug safety information.\",\"PeriodicalId\":11382,\"journal\":{\"name\":\"Drug Safety\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":3.8000,\"publicationDate\":\"2025-09-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Drug Safety\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1007/s40264-025-01594-x\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"PHARMACOLOGY & PHARMACY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Drug Safety","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s40264-025-01594-x","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PHARMACOLOGY & PHARMACY","Score":null,"Total":0}

引用次数: 0

摘要

药物不良反应（adr），包括由药物相互作用引起的不良反应，仍然是发病率和死亡率的主要原因。结构化产品标签（SPLs）是药品安全信息的主要来源。拥有机器可读的产品标签，包括不良反应（ARs）和药物相互作用，将使研究人员能够简化药物安全性研究。然而，提取这些信息是复杂的，需要使用自然语言处理（NLP）方法。目的：探讨生成语言模型在药物安全信息提取中的应用。方法：我们比较了多种生成LLMs （GPT， Llama和Mixtral）与两种基线方法在从SPLs中提取不良反应（ARs）的任务中。我们探索了影响这些模型在ar提取中的性能的各种因素，如提示策略和术语复杂性。最后，我们探索了生成模型在没有额外微调或训练的情况下从单个SPLs部分提取药物相互作用的能力，展示了它们在信息检索方面的灵活性和适应性。结果：我们发现生成语言模型，特别是GPT-4，能够匹配或超过以前最先进的模型的性能，而无需额外的训练或微调。此外，我们发现特定的SPL部分、周围环境和AR术语的复杂性会影响提取性能。最后，我们通过将这些模型应用于从药物相互作用部分提取药物名称的单独任务，证明了这些模型的泛化性。结论：生成语言模型在从药品安全清单中自动提取药品安全信息方面显示出巨大的潜力，为改善上市后监管和减少adr提供了一条有希望的途径。未来的工作应该集中在完善提示策略和扩展模型的能力，以处理日益复杂和微妙的药物安全信息。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Leveraging Large Language Models in Extracting Drug Safety Information from Prescription Drug Labels.

Introduction: Adverse drug reactions (ADRs), including those resulting from drug interactions, remain a leading cause of morbidity and mortality. Structured product labels (SPLs) serve as a primary source for drug safety information. Having machine-readable product labels, including adverse reactions (ARs) and drug interactions, readily available would allow researchers to streamline medication safety studies. However, extracting this information is complex and requires the use of natural language processing (NLP) methods.

Objective: In this study, we explored the application of generative language models in the extraction of drug safety information from SPLs.

Methods: We compared multiple generative LLMs (GPT, Llama, and Mixtral) to two baseline methods in the task of extracting adverse reactions (ARs) from SPLs. We explored various factors, such as prompting strategies and term complexity, that impact the performance of these models in the extraction of ARs. Finally, we explored the generative models' capacity to extract drug interactions from a separate section of SPLs without additional fine-tuning or training, demonstrating their flexibility and adaptability for information retrieval.

Results: We found that generative language models, specifically GPT-4, are able to match or exceed the performance of previous state-of-the-art models without additional training or fine-tuning. Additionally, we found that the specific SPL section, surrounding context, and complexity of the AR term impacted the extraction performance. Finally, we demonstrated the generalizability of these models by applying them to a separate task of extracting drug names from the drug interaction section where curated training data are not available.

Conclusion: Generative language models demonstrate significant potential for automating drug safety information extraction from SPLs, offering a promising avenue for improving post-market surveillance and reducing ADRs. Future work should focus on refining prompting strategies and expanding the models' capabilities to handle increasingly complex and nuanced drug safety information.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Drug Safety 医学-毒理学

CiteScore

7.60

自引率

7.10%

发文量

112

审稿时长

6-12 weeks

期刊介绍： Drug Safety is the official journal of the International Society of Pharmacovigilance. The journal includes: Overviews of contentious or emerging issues. Comprehensive narrative reviews that provide an authoritative source of information on epidemiology, clinical features, prevention and management of adverse effects of individual drugs and drug classes. In-depth benefit-risk assessment of adverse effect and efficacy data for a drug in a defined therapeutic area. Systematic reviews (with or without meta-analyses) that collate empirical evidence to answer a specific research question, using explicit, systematic methods as outlined by the PRISMA statement. Original research articles reporting the results of well-designed studies in disciplines such as pharmacoepidemiology, pharmacovigilance, pharmacology and toxicology, and pharmacogenomics. Editorials and commentaries on topical issues. Additional digital features (including animated abstracts, video abstracts, slide decks, audio slides, instructional videos, infographics, podcasts and animations) can be published with articles; these are designed to increase the visibility, readership and educational value of the journal’s content. In addition, articles published in Drug Safety Drugs may be accompanied by plain language summaries to assist readers who have some knowledge of, but not in-depth expertise in, the area to understand important medical advances.