Artificial Intelligence to Automate Network Meta-Analyses: Four Case Studies to Evaluate the Potential Application of Large Language Models.

IF 2 Q2 ECONOMICS

PharmacoEconomics Open Pub Date : 2024-03-01 Epub Date: 2024-02-10 DOI:10.1007/s41669-024-00476-9

Tim Reason, Emma Benbow, Julia Langham, Andy Gimblett, Sven L Klijn, Bill Malcolm

{"title":"Artificial Intelligence to Automate Network Meta-Analyses: Four Case Studies to Evaluate the Potential Application of Large Language Models.","authors":"Tim Reason, Emma Benbow, Julia Langham, Andy Gimblett, Sven L Klijn, Bill Malcolm","doi":"10.1007/s41669-024-00476-9","DOIUrl":null,"url":null,"abstract":"Background: The emergence of artificial intelligence, capable of human-level performance on some tasks, presents an opportunity to revolutionise development of systematic reviews and network meta-analyses (NMAs). In this pilot study, we aim to assess use of a large-language model (LLM, Generative Pre-trained Transformer 4 [GPT-4]) to automatically extract data from publications, write an R script to conduct an NMA and interpret the results.Methods: We considered four case studies involving binary and time-to-event outcomes in two disease areas, for which an NMA had previously been conducted manually. For each case study, a Python script was developed that communicated with the LLM via application programming interface (API) calls. The LLM was prompted to extract relevant data from publications, to create an R script to be used to run the NMA and then to produce a small report describing the analysis.Results: The LLM had a > 99% success rate of accurately extracting data across 20 runs for each case study and could generate R scripts that could be run end-to-end without human input. It also produced good quality reports describing the disease area, analysis conducted, results obtained and a correct interpretation of the results.Conclusions: This study provides a promising indication of the feasibility of using current generation LLMs to automate data extraction, code generation and NMA result interpretation, which could result in significant time savings and reduce human error. This is provided that routine technical checks are performed, as recommend for human-conducted analyses. Whilst not currently 100% consistent, LLMs are likely to improve with time.","PeriodicalId":19770,"journal":{"name":"PharmacoEconomics Open","volume":" ","pages":"205-220"},"PeriodicalIF":2.0000,"publicationDate":"2024-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10884375/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PharmacoEconomics Open","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s41669-024-00476-9","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/2/10 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"ECONOMICS","Score":null,"Total":0}

引用次数: 0

Abstract

Background: The emergence of artificial intelligence, capable of human-level performance on some tasks, presents an opportunity to revolutionise development of systematic reviews and network meta-analyses (NMAs). In this pilot study, we aim to assess use of a large-language model (LLM, Generative Pre-trained Transformer 4 [GPT-4]) to automatically extract data from publications, write an R script to conduct an NMA and interpret the results.

Methods: We considered four case studies involving binary and time-to-event outcomes in two disease areas, for which an NMA had previously been conducted manually. For each case study, a Python script was developed that communicated with the LLM via application programming interface (API) calls. The LLM was prompted to extract relevant data from publications, to create an R script to be used to run the NMA and then to produce a small report describing the analysis.

Results: The LLM had a > 99% success rate of accurately extracting data across 20 runs for each case study and could generate R scripts that could be run end-to-end without human input. It also produced good quality reports describing the disease area, analysis conducted, results obtained and a correct interpretation of the results.

Conclusions: This study provides a promising indication of the feasibility of using current generation LLMs to automate data extraction, code generation and NMA result interpretation, which could result in significant time savings and reduce human error. This is provided that routine technical checks are performed, as recommend for human-conducted analyses. Whilst not currently 100% consistent, LLMs are likely to improve with time.

查看原文本刊更多论文

人工智能自动化网络元分析：评估大型语言模型潜在应用的四项案例研究》。

背景：人工智能的出现为系统综述和网络荟萃分析（NMA）的发展带来了革命性的机遇，因为人工智能能够在某些任务上达到人类的水平。在这项试验性研究中，我们旨在评估使用大型语言模型（LLM，生成预训练转换器 4 [GPT-4]）自动从出版物中提取数据、编写 R 脚本以进行 NMA 并解释结果的情况：我们考虑了两个疾病领域中涉及二元和时间到事件结果的四项案例研究，之前已对这些案例研究进行了人工NMA分析。针对每个案例研究，我们都开发了一个 Python 脚本，通过调用应用编程接口 (API) 与 LLM 通信。提示 LLM 从出版物中提取相关数据，创建用于运行 NMA 的 R 脚本，然后生成一份描述分析结果的小报告：LLM 在每个案例研究的 20 次运行中准确提取数据的成功率大于 99%，并能生成无需人工输入即可端到端运行的 R 脚本。它还能生成高质量的报告，描述疾病领域、进行的分析、获得的结果以及对结果的正确解释：这项研究很好地说明了使用当前一代 LLM 自动进行数据提取、代码生成和 NMA 结果解释的可行性，这将大大节省时间并减少人为错误。前提是按照人工分析的建议进行常规技术检查。尽管目前 LLMs 还没有达到 100% 的一致性，但随着时间的推移，LLMs 很可能会得到改善。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

PharmacoEconomics Open Multiple-

CiteScore

3.50

自引率

0.00%

发文量

审稿时长

8 weeks

期刊介绍： PharmacoEconomics - Open focuses on applied research on the economic implications and health outcomes associated with drugs, devices and other healthcare interventions. The journal includes, but is not limited to, the following research areas:Economic analysis of healthcare interventionsHealth outcomes researchCost-of-illness studiesQuality-of-life studiesAdditional digital features (including animated abstracts, video abstracts, slide decks, audio slides, instructional videos, infographics, podcasts and animations) can be published with articles; these are designed to increase the visibility, readership and educational value of the journal’s content. In addition, articles published in PharmacoEconomics -Open may be accompanied by plain language summaries to assist readers who have some knowledge of, but not in-depth expertise in, the area to understand important medical advances.All manuscripts are subject to peer review by international experts. Letters to the Editor are welcomed and will be considered for publication.