ELEVATE-GenAI：在卫生经济学和结果研究中使用大型语言模型的报告指南：ISPOR生成人工智能报告工作组。

IF 6 2区医学 Q1 ECONOMICS

Value in Health Pub Date : 2025-07-11 DOI:10.1016/j.jval.2025.06.018

Rachael L Fleurence, Dalia Dawoud, Jiang Bian, Mitchell K Higashi, Xiaoyan Wang, Hua Xu, Jagpreet Chhatwal, Turgay Ayer

{"title":"ELEVATE-GenAI：在卫生经济学和结果研究中使用大型语言模型的报告指南：ISPOR生成人工智能报告工作组。","authors":"Rachael L Fleurence, Dalia Dawoud, Jiang Bian, Mitchell K Higashi, Xiaoyan Wang, Hua Xu, Jagpreet Chhatwal, Turgay Ayer","doi":"10.1016/j.jval.2025.06.018","DOIUrl":null,"url":null,"abstract":"Objectives: Generative artificial intelligence (AI), particularly large language models (LLMs), holds significant promise for health economics and outcomes research (HEOR). However, standardized reporting guidance for LLM-assisted research is lacking. This article introduces the ELEVATE-GenAI framework and checklist-reporting guidelines specifically designed for HEOR studies involving LLMs.Methods: The framework was developed through a targeted literature review of existing reporting guidelines, AI evaluation frameworks, and expert input from the ISPOR Working Group on Generative AI. It comprises 10 domains-including model characteristics, accuracy, reproducibility, and fairness and bias. The accompanying checklist translates the framework into actionable reporting items. To illustrate its use, the framework was applied to 2 published HEOR studies: one focused on a systematic literature review tasks and the other on economic modeling.Results: The ELEVATE-GenAI framework offers a comprehensive structure for reporting LLM-assisted HEOR research, while the checklist facilitates practical implementation. Its application to the 2 case studies demonstrates its relevance and usability across different HEOR contexts.Conclusions: Although the framework provides robust reporting guidance, further empirical testing is needed to assess its validity, completeness, usability, and generalizability across diverse HEOR use cases. The ELEVATE-GenAI framework and checklist address a critical gap by offering structured guidance for transparent, accurate, and reproducible reporting of LLM-assisted HEOR research. Future work will focus on extensive testing and validation to support broader adoption and refinement.","PeriodicalId":23508,"journal":{"name":"Value in Health","volume":" ","pages":""},"PeriodicalIF":6.0000,"publicationDate":"2025-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"ELEVATE-GenAI: Reporting Guidelines for the Use of Large Language Models in Health Economics and Outcomes Research: An ISPOR Working Group Report.\",\"authors\":\"Rachael L Fleurence, Dalia Dawoud, Jiang Bian, Mitchell K Higashi, Xiaoyan Wang, Hua Xu, Jagpreet Chhatwal, Turgay Ayer\",\"doi\":\"10.1016/j.jval.2025.06.018\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Objectives: Generative artificial intelligence (AI), particularly large language models (LLMs), holds significant promise for health economics and outcomes research (HEOR). However, standardized reporting guidance for LLM-assisted research is lacking. This article introduces the ELEVATE-GenAI framework and checklist-reporting guidelines specifically designed for HEOR studies involving LLMs.Methods: The framework was developed through a targeted literature review of existing reporting guidelines, AI evaluation frameworks, and expert input from the ISPOR Working Group on Generative AI. It comprises 10 domains-including model characteristics, accuracy, reproducibility, and fairness and bias. The accompanying checklist translates the framework into actionable reporting items. To illustrate its use, the framework was applied to 2 published HEOR studies: one focused on a systematic literature review tasks and the other on economic modeling.Results: The ELEVATE-GenAI framework offers a comprehensive structure for reporting LLM-assisted HEOR research, while the checklist facilitates practical implementation. Its application to the 2 case studies demonstrates its relevance and usability across different HEOR contexts.Conclusions: Although the framework provides robust reporting guidance, further empirical testing is needed to assess its validity, completeness, usability, and generalizability across diverse HEOR use cases. The ELEVATE-GenAI framework and checklist address a critical gap by offering structured guidance for transparent, accurate, and reproducible reporting of LLM-assisted HEOR research. Future work will focus on extensive testing and validation to support broader adoption and refinement.\",\"PeriodicalId\":23508,\"journal\":{\"name\":\"Value in Health\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":6.0000,\"publicationDate\":\"2025-07-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Value in Health\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1016/j.jval.2025.06.018\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ECONOMICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Value in Health","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.jval.2025.06.018","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECONOMICS","Score":null,"Total":0}

引用次数: 0

摘要

导读：生成式人工智能（AI），特别是大型语言模型（llm），在健康经济学和结果研究（HEOR）中具有重要的前景。然而，法学硕士辅助研究的标准化报告指导是缺乏的。本文介绍了专为涉及法学硕士的高等教育研究设计的ELEVATE-GenAI框架和清单报告指南。方法：该框架是通过对现有报告指南、人工智能评估框架和ISPOR生成式人工智能工作组的专家意见进行有针对性的文献综述而开发的。它包括十个领域——包括模型特征、准确性、可再现性、公平性和偏差。附带的检查表将框架转换为可操作的报告项。为了说明其用途，将该框架应用于两项已发表的HEOR研究：一项侧重于系统文献综述任务，另一项侧重于经济建模。结果：ELEVATE-GenAI框架为报告法学硕士协助的HEOR研究提供了一个全面的结构，而清单有助于实际实施。它在两个案例研究中的应用证明了它在不同高or环境中的相关性和可用性。局限性：尽管该框架提供了可靠的报告指导，但需要进一步的实证测试来评估其有效性、完整性、可用性以及在不同HEOR用例中的普遍性。结论：ELEVATE-GenAI框架和检查表通过为透明、准确和可重复的法学硕士辅助HEOR研究报告提供结构化指导，解决了一个关键的空白。未来的工作将集中在广泛的测试和验证上，以支持更广泛的采用和改进。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

ELEVATE-GenAI: Reporting Guidelines for the Use of Large Language Models in Health Economics and Outcomes Research: An ISPOR Working Group Report.

Objectives: Generative artificial intelligence (AI), particularly large language models (LLMs), holds significant promise for health economics and outcomes research (HEOR). However, standardized reporting guidance for LLM-assisted research is lacking. This article introduces the ELEVATE-GenAI framework and checklist-reporting guidelines specifically designed for HEOR studies involving LLMs.

Methods: The framework was developed through a targeted literature review of existing reporting guidelines, AI evaluation frameworks, and expert input from the ISPOR Working Group on Generative AI. It comprises 10 domains-including model characteristics, accuracy, reproducibility, and fairness and bias. The accompanying checklist translates the framework into actionable reporting items. To illustrate its use, the framework was applied to 2 published HEOR studies: one focused on a systematic literature review tasks and the other on economic modeling.

Results: The ELEVATE-GenAI framework offers a comprehensive structure for reporting LLM-assisted HEOR research, while the checklist facilitates practical implementation. Its application to the 2 case studies demonstrates its relevance and usability across different HEOR contexts.

Conclusions: Although the framework provides robust reporting guidance, further empirical testing is needed to assess its validity, completeness, usability, and generalizability across diverse HEOR use cases. The ELEVATE-GenAI framework and checklist address a critical gap by offering structured guidance for transparent, accurate, and reproducible reporting of LLM-assisted HEOR research. Future work will focus on extensive testing and validation to support broader adoption and refinement.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Value in Health 医学-卫生保健

CiteScore

6.90

自引率

6.70%

发文量

3064

审稿时长

3-8 weeks

期刊介绍： Value in Health contains original research articles for pharmacoeconomics, health economics, and outcomes research (clinical, economic, and patient-reported outcomes/preference-based research), as well as conceptual and health policy articles that provide valuable information for health care decision-makers as well as the research community. As the official journal of ISPOR, Value in Health provides a forum for researchers, as well as health care decision-makers to translate outcomes research into health care decisions.