Undina Gisladottir, Michael Zietz, Sophia Kivelson, Yutaro Tanaka, Gaurav Sirdeshmukh, Kathleen LaRow Brown, Nicholas P Tatonetti
{"title":"利用大型语言模型从处方药标签中提取药物安全信息。","authors":"Undina Gisladottir, Michael Zietz, Sophia Kivelson, Yutaro Tanaka, Gaurav Sirdeshmukh, Kathleen LaRow Brown, Nicholas P Tatonetti","doi":"10.1007/s40264-025-01594-x","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Adverse drug reactions (ADRs), including those resulting from drug interactions, remain a leading cause of morbidity and mortality. Structured product labels (SPLs) serve as a primary source for drug safety information. Having machine-readable product labels, including adverse reactions (ARs) and drug interactions, readily available would allow researchers to streamline medication safety studies. However, extracting this information is complex and requires the use of natural language processing (NLP) methods.</p><p><strong>Objective: </strong>In this study, we explored the application of generative language models in the extraction of drug safety information from SPLs.</p><p><strong>Methods: </strong>We compared multiple generative LLMs (GPT, Llama, and Mixtral) to two baseline methods in the task of extracting adverse reactions (ARs) from SPLs. We explored various factors, such as prompting strategies and term complexity, that impact the performance of these models in the extraction of ARs. Finally, we explored the generative models' capacity to extract drug interactions from a separate section of SPLs without additional fine-tuning or training, demonstrating their flexibility and adaptability for information retrieval.</p><p><strong>Results: </strong>We found that generative language models, specifically GPT-4, are able to match or exceed the performance of previous state-of-the-art models without additional training or fine-tuning. Additionally, we found that the specific SPL section, surrounding context, and complexity of the AR term impacted the extraction performance. Finally, we demonstrated the generalizability of these models by applying them to a separate task of extracting drug names from the drug interaction section where curated training data are not available.</p><p><strong>Conclusion: </strong>Generative language models demonstrate significant potential for automating drug safety information extraction from SPLs, offering a promising avenue for improving post-market surveillance and reducing ADRs. Future work should focus on refining prompting strategies and expanding the models' capabilities to handle increasingly complex and nuanced drug safety information.</p>","PeriodicalId":11382,"journal":{"name":"Drug Safety","volume":" ","pages":""},"PeriodicalIF":3.8000,"publicationDate":"2025-09-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Leveraging Large Language Models in Extracting Drug Safety Information from Prescription Drug Labels.\",\"authors\":\"Undina Gisladottir, Michael Zietz, Sophia Kivelson, Yutaro Tanaka, Gaurav Sirdeshmukh, Kathleen LaRow Brown, Nicholas P Tatonetti\",\"doi\":\"10.1007/s40264-025-01594-x\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Introduction: </strong>Adverse drug reactions (ADRs), including those resulting from drug interactions, remain a leading cause of morbidity and mortality. Structured product labels (SPLs) serve as a primary source for drug safety information. Having machine-readable product labels, including adverse reactions (ARs) and drug interactions, readily available would allow researchers to streamline medication safety studies. However, extracting this information is complex and requires the use of natural language processing (NLP) methods.</p><p><strong>Objective: </strong>In this study, we explored the application of generative language models in the extraction of drug safety information from SPLs.</p><p><strong>Methods: </strong>We compared multiple generative LLMs (GPT, Llama, and Mixtral) to two baseline methods in the task of extracting adverse reactions (ARs) from SPLs. We explored various factors, such as prompting strategies and term complexity, that impact the performance of these models in the extraction of ARs. Finally, we explored the generative models' capacity to extract drug interactions from a separate section of SPLs without additional fine-tuning or training, demonstrating their flexibility and adaptability for information retrieval.</p><p><strong>Results: </strong>We found that generative language models, specifically GPT-4, are able to match or exceed the performance of previous state-of-the-art models without additional training or fine-tuning. Additionally, we found that the specific SPL section, surrounding context, and complexity of the AR term impacted the extraction performance. Finally, we demonstrated the generalizability of these models by applying them to a separate task of extracting drug names from the drug interaction section where curated training data are not available.</p><p><strong>Conclusion: </strong>Generative language models demonstrate significant potential for automating drug safety information extraction from SPLs, offering a promising avenue for improving post-market surveillance and reducing ADRs. Future work should focus on refining prompting strategies and expanding the models' capabilities to handle increasingly complex and nuanced drug safety information.</p>\",\"PeriodicalId\":11382,\"journal\":{\"name\":\"Drug Safety\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":3.8000,\"publicationDate\":\"2025-09-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Drug Safety\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1007/s40264-025-01594-x\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"PHARMACOLOGY & PHARMACY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Drug Safety","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s40264-025-01594-x","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PHARMACOLOGY & PHARMACY","Score":null,"Total":0}
Leveraging Large Language Models in Extracting Drug Safety Information from Prescription Drug Labels.
Introduction: Adverse drug reactions (ADRs), including those resulting from drug interactions, remain a leading cause of morbidity and mortality. Structured product labels (SPLs) serve as a primary source for drug safety information. Having machine-readable product labels, including adverse reactions (ARs) and drug interactions, readily available would allow researchers to streamline medication safety studies. However, extracting this information is complex and requires the use of natural language processing (NLP) methods.
Objective: In this study, we explored the application of generative language models in the extraction of drug safety information from SPLs.
Methods: We compared multiple generative LLMs (GPT, Llama, and Mixtral) to two baseline methods in the task of extracting adverse reactions (ARs) from SPLs. We explored various factors, such as prompting strategies and term complexity, that impact the performance of these models in the extraction of ARs. Finally, we explored the generative models' capacity to extract drug interactions from a separate section of SPLs without additional fine-tuning or training, demonstrating their flexibility and adaptability for information retrieval.
Results: We found that generative language models, specifically GPT-4, are able to match or exceed the performance of previous state-of-the-art models without additional training or fine-tuning. Additionally, we found that the specific SPL section, surrounding context, and complexity of the AR term impacted the extraction performance. Finally, we demonstrated the generalizability of these models by applying them to a separate task of extracting drug names from the drug interaction section where curated training data are not available.
Conclusion: Generative language models demonstrate significant potential for automating drug safety information extraction from SPLs, offering a promising avenue for improving post-market surveillance and reducing ADRs. Future work should focus on refining prompting strategies and expanding the models' capabilities to handle increasingly complex and nuanced drug safety information.
期刊介绍:
Drug Safety is the official journal of the International Society of Pharmacovigilance. The journal includes:
Overviews of contentious or emerging issues.
Comprehensive narrative reviews that provide an authoritative source of information on epidemiology, clinical features, prevention and management of adverse effects of individual drugs and drug classes.
In-depth benefit-risk assessment of adverse effect and efficacy data for a drug in a defined therapeutic area.
Systematic reviews (with or without meta-analyses) that collate empirical evidence to answer a specific research question, using explicit, systematic methods as outlined by the PRISMA statement.
Original research articles reporting the results of well-designed studies in disciplines such as pharmacoepidemiology, pharmacovigilance, pharmacology and toxicology, and pharmacogenomics.
Editorials and commentaries on topical issues.
Additional digital features (including animated abstracts, video abstracts, slide decks, audio slides, instructional videos, infographics, podcasts and animations) can be published with articles; these are designed to increase the visibility, readership and educational value of the journal’s content. In addition, articles published in Drug Safety Drugs may be accompanied by plain language summaries to assist readers who have some knowledge of, but not in-depth expertise in, the area to understand important medical advances.