Rachael L Fleurence, Dalia Dawoud, Jiang Bian, Mitchell K Higashi, Xiaoyan Wang, Hua Xu, Jagpreet Chhatwal, Turgay Ayer
{"title":"ELEVATE-GenAI:在卫生经济学和结果研究中使用大型语言模型的报告指南:ISPOR生成人工智能报告工作组。","authors":"Rachael L Fleurence, Dalia Dawoud, Jiang Bian, Mitchell K Higashi, Xiaoyan Wang, Hua Xu, Jagpreet Chhatwal, Turgay Ayer","doi":"10.1016/j.jval.2025.06.018","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>Generative artificial intelligence (AI), particularly large language models (LLMs), holds significant promise for health economics and outcomes research (HEOR). However, standardized reporting guidance for LLM-assisted research is lacking. This article introduces the ELEVATE-GenAI framework and checklist-reporting guidelines specifically designed for HEOR studies involving LLMs.</p><p><strong>Methods: </strong>The framework was developed through a targeted literature review of existing reporting guidelines, AI evaluation frameworks, and expert input from the ISPOR Working Group on Generative AI. It comprises 10 domains-including model characteristics, accuracy, reproducibility, and fairness and bias. The accompanying checklist translates the framework into actionable reporting items. To illustrate its use, the framework was applied to 2 published HEOR studies: one focused on a systematic literature review tasks and the other on economic modeling.</p><p><strong>Results: </strong>The ELEVATE-GenAI framework offers a comprehensive structure for reporting LLM-assisted HEOR research, while the checklist facilitates practical implementation. Its application to the 2 case studies demonstrates its relevance and usability across different HEOR contexts.</p><p><strong>Conclusions: </strong>Although the framework provides robust reporting guidance, further empirical testing is needed to assess its validity, completeness, usability, and generalizability across diverse HEOR use cases. The ELEVATE-GenAI framework and checklist address a critical gap by offering structured guidance for transparent, accurate, and reproducible reporting of LLM-assisted HEOR research. Future work will focus on extensive testing and validation to support broader adoption and refinement.</p>","PeriodicalId":23508,"journal":{"name":"Value in Health","volume":" ","pages":""},"PeriodicalIF":6.0000,"publicationDate":"2025-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"ELEVATE-GenAI: Reporting Guidelines for the Use of Large Language Models in Health Economics and Outcomes Research: An ISPOR Working Group Report.\",\"authors\":\"Rachael L Fleurence, Dalia Dawoud, Jiang Bian, Mitchell K Higashi, Xiaoyan Wang, Hua Xu, Jagpreet Chhatwal, Turgay Ayer\",\"doi\":\"10.1016/j.jval.2025.06.018\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objectives: </strong>Generative artificial intelligence (AI), particularly large language models (LLMs), holds significant promise for health economics and outcomes research (HEOR). However, standardized reporting guidance for LLM-assisted research is lacking. This article introduces the ELEVATE-GenAI framework and checklist-reporting guidelines specifically designed for HEOR studies involving LLMs.</p><p><strong>Methods: </strong>The framework was developed through a targeted literature review of existing reporting guidelines, AI evaluation frameworks, and expert input from the ISPOR Working Group on Generative AI. It comprises 10 domains-including model characteristics, accuracy, reproducibility, and fairness and bias. The accompanying checklist translates the framework into actionable reporting items. To illustrate its use, the framework was applied to 2 published HEOR studies: one focused on a systematic literature review tasks and the other on economic modeling.</p><p><strong>Results: </strong>The ELEVATE-GenAI framework offers a comprehensive structure for reporting LLM-assisted HEOR research, while the checklist facilitates practical implementation. Its application to the 2 case studies demonstrates its relevance and usability across different HEOR contexts.</p><p><strong>Conclusions: </strong>Although the framework provides robust reporting guidance, further empirical testing is needed to assess its validity, completeness, usability, and generalizability across diverse HEOR use cases. The ELEVATE-GenAI framework and checklist address a critical gap by offering structured guidance for transparent, accurate, and reproducible reporting of LLM-assisted HEOR research. Future work will focus on extensive testing and validation to support broader adoption and refinement.</p>\",\"PeriodicalId\":23508,\"journal\":{\"name\":\"Value in Health\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":6.0000,\"publicationDate\":\"2025-07-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Value in Health\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1016/j.jval.2025.06.018\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ECONOMICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Value in Health","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.jval.2025.06.018","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECONOMICS","Score":null,"Total":0}
ELEVATE-GenAI: Reporting Guidelines for the Use of Large Language Models in Health Economics and Outcomes Research: An ISPOR Working Group Report.
Objectives: Generative artificial intelligence (AI), particularly large language models (LLMs), holds significant promise for health economics and outcomes research (HEOR). However, standardized reporting guidance for LLM-assisted research is lacking. This article introduces the ELEVATE-GenAI framework and checklist-reporting guidelines specifically designed for HEOR studies involving LLMs.
Methods: The framework was developed through a targeted literature review of existing reporting guidelines, AI evaluation frameworks, and expert input from the ISPOR Working Group on Generative AI. It comprises 10 domains-including model characteristics, accuracy, reproducibility, and fairness and bias. The accompanying checklist translates the framework into actionable reporting items. To illustrate its use, the framework was applied to 2 published HEOR studies: one focused on a systematic literature review tasks and the other on economic modeling.
Results: The ELEVATE-GenAI framework offers a comprehensive structure for reporting LLM-assisted HEOR research, while the checklist facilitates practical implementation. Its application to the 2 case studies demonstrates its relevance and usability across different HEOR contexts.
Conclusions: Although the framework provides robust reporting guidance, further empirical testing is needed to assess its validity, completeness, usability, and generalizability across diverse HEOR use cases. The ELEVATE-GenAI framework and checklist address a critical gap by offering structured guidance for transparent, accurate, and reproducible reporting of LLM-assisted HEOR research. Future work will focus on extensive testing and validation to support broader adoption and refinement.
期刊介绍:
Value in Health contains original research articles for pharmacoeconomics, health economics, and outcomes research (clinical, economic, and patient-reported outcomes/preference-based research), as well as conceptual and health policy articles that provide valuable information for health care decision-makers as well as the research community. As the official journal of ISPOR, Value in Health provides a forum for researchers, as well as health care decision-makers to translate outcomes research into health care decisions.