“人工智能统计学家”：利用生成式人工智能选择合适的模型并执行网络元分析。

IF 6 2区医学 Q1 ECONOMICS

Value in Health Pub Date : 2025-08-12 DOI:10.1016/j.jval.2025.08.001

Tim Reason, Yunchou Wu, Cheryl Jones, Emma Benbow, Kasper Johannesen, Bill Malcolm

{"title":"“人工智能统计学家”：利用生成式人工智能选择合适的模型并执行网络元分析。","authors":"Tim Reason, Yunchou Wu, Cheryl Jones, Emma Benbow, Kasper Johannesen, Bill Malcolm","doi":"10.1016/j.jval.2025.08.001","DOIUrl":null,"url":null,"abstract":"Objectives: This exploratory study aimed to develop a large language model (LLM)-based process to automate components of network meta-analysis (NMA), including model selection, analysis, output evaluation, and results interpretation. Automating these tasks with LLMs can enhance efficiency, consistency, and scalability in health economics and outcomes research, while ensuring that analyses adhere to established guidelines required by health technology assessment agencies. Improvements in efficiency and scalability may potentially become relevant as the European Union Health Technology Assessment Regulation comes into force, given anticipated analysis requirements and timelines.Methods: Using Claude 3.5 Sonnet (V2), a process was designed to automate statistical model selection, NMA output evaluation, and results interpretation based on an \"analysis-ready\" data set. Validation was assessed by replicating examples from the National Institute for Health and Care Excellence Technical Support Document (TSD2), replicating results of non-Decision Support Unit-published NMAs, and generating comprehensive outputs (eg, heterogeneity, inconsistency, and convergence).Results: The automated LLM-based process produced accurate results. Compared with TSD2 examples, differences were minimal, within expectations (given differences in sampling frameworks used), and comparable to those observed between estimates produced by the R vignettes against TSD2. Similar consistency was noted for non-Decision Support Unit-published NMA examples. Additionally, the LLM process generated and interpreted comprehensive NMA outputs.Conclusions: This exploratory study demonstrates the feasibility of LLMs to automate key components of NMAs, determining the requisite NMA framework based only on input data. Further exploring these capabilities could clarify their role in streamlining NMA workflows.","PeriodicalId":23508,"journal":{"name":"Value in Health","volume":" ","pages":""},"PeriodicalIF":6.0000,"publicationDate":"2025-08-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"The \\\"Artificial Intelligence Statistician\\\": Utilizing Generative Artificial Intelligence to Select an Appropriate Model and Execute Network Meta-Analyses.\",\"authors\":\"Tim Reason, Yunchou Wu, Cheryl Jones, Emma Benbow, Kasper Johannesen, Bill Malcolm\",\"doi\":\"10.1016/j.jval.2025.08.001\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Objectives: This exploratory study aimed to develop a large language model (LLM)-based process to automate components of network meta-analysis (NMA), including model selection, analysis, output evaluation, and results interpretation. Automating these tasks with LLMs can enhance efficiency, consistency, and scalability in health economics and outcomes research, while ensuring that analyses adhere to established guidelines required by health technology assessment agencies. Improvements in efficiency and scalability may potentially become relevant as the European Union Health Technology Assessment Regulation comes into force, given anticipated analysis requirements and timelines.Methods: Using Claude 3.5 Sonnet (V2), a process was designed to automate statistical model selection, NMA output evaluation, and results interpretation based on an \\\"analysis-ready\\\" data set. Validation was assessed by replicating examples from the National Institute for Health and Care Excellence Technical Support Document (TSD2), replicating results of non-Decision Support Unit-published NMAs, and generating comprehensive outputs (eg, heterogeneity, inconsistency, and convergence).Results: The automated LLM-based process produced accurate results. Compared with TSD2 examples, differences were minimal, within expectations (given differences in sampling frameworks used), and comparable to those observed between estimates produced by the R vignettes against TSD2. Similar consistency was noted for non-Decision Support Unit-published NMA examples. Additionally, the LLM process generated and interpreted comprehensive NMA outputs.Conclusions: This exploratory study demonstrates the feasibility of LLMs to automate key components of NMAs, determining the requisite NMA framework based only on input data. Further exploring these capabilities could clarify their role in streamlining NMA workflows.\",\"PeriodicalId\":23508,\"journal\":{\"name\":\"Value in Health\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":6.0000,\"publicationDate\":\"2025-08-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Value in Health\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1016/j.jval.2025.08.001\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ECONOMICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Value in Health","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.jval.2025.08.001","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECONOMICS","Score":null,"Total":0}

引用次数: 0

摘要

目的：本探索性研究旨在开发一个基于大型语言模型（LLM）的过程，以自动化网络元分析（NMA）的组成部分，包括模型选择、分析、输出评估和结果解释。用法学硕士自动化这些任务可以提高卫生经济学和成果研究的效率、一致性和可扩展性，同时确保分析符合卫生技术评估机构所要求的既定指导方针。考虑到预期的分析要求和时间表，随着欧盟卫生技术评估条例（HTAR）生效，效率和可扩展性的改进可能会变得相关。方法：使用Claude 3.5 Sonnet [V2]，设计了一个基于“分析就绪”数据集的自动统计模型选择、NMA输出评估和结果解释的过程。通过复制国家健康与护理卓越研究所（NICE）技术支持文件（TSD2）中的示例来评估有效性；非dsu发表的nma的复制结果；并产生全面的输出（例如，异质性，不一致性，收敛性）。结果：基于llm的自动化过程产生了准确的结果。与tts2的例子相比，差异很小，在预期范围内（考虑到所使用的抽样框架的差异），并且与针对tts2的R小插曲产生的估计之间观察到的差异相当。对于非dsu发布的NMA示例，也注意到类似的一致性。此外，法学硕士过程生成并解释了全面的NMA输出。结论：本探索性研究证明了llm自动化NMA关键组件的可行性，仅根据输入数据确定必要的NMA框架。进一步探索这些功能可以澄清它们在简化NMA工作流程中的作用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

The "Artificial Intelligence Statistician": Utilizing Generative Artificial Intelligence to Select an Appropriate Model and Execute Network Meta-Analyses.

Objectives: This exploratory study aimed to develop a large language model (LLM)-based process to automate components of network meta-analysis (NMA), including model selection, analysis, output evaluation, and results interpretation. Automating these tasks with LLMs can enhance efficiency, consistency, and scalability in health economics and outcomes research, while ensuring that analyses adhere to established guidelines required by health technology assessment agencies. Improvements in efficiency and scalability may potentially become relevant as the European Union Health Technology Assessment Regulation comes into force, given anticipated analysis requirements and timelines.

Methods: Using Claude 3.5 Sonnet (V2), a process was designed to automate statistical model selection, NMA output evaluation, and results interpretation based on an "analysis-ready" data set. Validation was assessed by replicating examples from the National Institute for Health and Care Excellence Technical Support Document (TSD2), replicating results of non-Decision Support Unit-published NMAs, and generating comprehensive outputs (eg, heterogeneity, inconsistency, and convergence).

Results: The automated LLM-based process produced accurate results. Compared with TSD2 examples, differences were minimal, within expectations (given differences in sampling frameworks used), and comparable to those observed between estimates produced by the R vignettes against TSD2. Similar consistency was noted for non-Decision Support Unit-published NMA examples. Additionally, the LLM process generated and interpreted comprehensive NMA outputs.

Conclusions: This exploratory study demonstrates the feasibility of LLMs to automate key components of NMAs, determining the requisite NMA framework based only on input data. Further exploring these capabilities could clarify their role in streamlining NMA workflows.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Value in Health 医学-卫生保健

CiteScore

6.90

自引率

6.70%

发文量

3064

审稿时长

3-8 weeks

期刊介绍： Value in Health contains original research articles for pharmacoeconomics, health economics, and outcomes research (clinical, economic, and patient-reported outcomes/preference-based research), as well as conceptual and health policy articles that provide valuable information for health care decision-makers as well as the research community. As the official journal of ISPOR, Value in Health provides a forum for researchers, as well as health care decision-makers to translate outcomes research into health care decisions.