IF 2.2 4区 医学 Q2 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH
Kelli DePriest, John Feher, Kailen Gore, LaShawn Glasgow, Clint Grant, Peter Holtgrave, Karen Hacker, Robert Chew
{"title":"Content Analysis of Social Determinants of Health Accelerator Plans Using Artificial Intelligence: A Use Case for Public Health Practitioners.","authors":"Kelli DePriest, John Feher, Kailen Gore, LaShawn Glasgow, Clint Grant, Peter Holtgrave, Karen Hacker, Robert Chew","doi":"10.1097/PHH.0000000000002148","DOIUrl":null,"url":null,"abstract":"<p><strong>Context: </strong>Public health practice involves the development of reports and plans, including funding progress reports, strategic plans, and community needs assessments. These documents are valuable data sources for program monitoring and evaluation. However, practitioners rarely have the bandwidth to thoroughly and rapidly review large amounts of primarily qualitative data to support real-time and continuous program improvement. Systematically examining and categorizing qualitative data through content analysis is labor-intensive. Large language models (LLMs), a type of generative artificial intelligence (AI) focused on language-based tasks, hold promise for expediting content analysis of public health documents, which, in turn, could facilitate continuous program improvement.</p><p><strong>Objectives: </strong>To explore the feasibility and potential of using LLMs to expedite content analysis of real-world public health documents. The focus was on comparing semiautomated outputs from GPT-4o with human outputs for abstracting and synthesizing information from health improvement plans.</p><p><strong>Design: </strong>Our study team conducted a content analysis of 4 publicly available community health improvement plans and compared the results with GPT-4o's performance on 20 data elements. We also assessed the resources required for both methods, including time spent on prompt engineering and error correction.</p><p><strong>Main outcome measures: </strong>Accuracy of data abstraction and time required.</p><p><strong>Results: </strong>GPT-4o demonstrated abstraction accuracy of 79% (n = 17 errors) compared to 94% accuracy by the study team for individual plans, with 8 instances of falsified data. Out of the 18 synthesis data elements, GPT-4o made 9 errors, demonstrating an accuracy of 50%. On average, GPT-4o abstraction required fewer hours than study team abstraction, but resource savings diminished when accounting for time for developing prompts and identifying/correcting errors.</p><p><strong>Conclusions: </strong>Public health professionals who explore the use of generative AI tools should approach the method with cautious curiosity and consider the potential tradeoffs between resource savings and data accuracy.</p>","PeriodicalId":47855,"journal":{"name":"Journal of Public Health Management and Practice","volume":" ","pages":""},"PeriodicalIF":2.2000,"publicationDate":"2025-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Public Health Management and Practice","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1097/PHH.0000000000002148","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 0

摘要

背景:公共卫生实践涉及报告和计划的制定,包括筹资进度报告、战略计划和社区需求评估。这些文件是计划监督和评估的宝贵数据来源。然而,从业人员很少有足够的时间来彻底、快速地审查大量主要是定性的数据,以支持实时和持续的项目改进。通过内容分析对定性数据进行系统检查和分类是一项劳动密集型工作。大型语言模型(LLMs)是一种生成式人工智能(AI),专注于基于语言的任务,有望加快公共卫生文件的内容分析,进而促进项目的持续改进:探索使用 LLMs 加快现实世界公共卫生文件内容分析的可行性和潜力。重点是比较 GPT-4o 的半自动化输出结果与人工输出结果,以便从健康改进计划中抽取和综合信息:设计:我们的研究团队对 4 个公开的社区健康改善计划进行了内容分析,并将分析结果与 GPT-4o 在 20 个数据元素上的表现进行了比较。我们还评估了两种方法所需的资源,包括用于及时工程和纠错的时间:结果:结果:GPT-4o 的抽取准确率为 79%(n = 17 个错误),而研究团队对单个计划的抽取准确率为 94%,其中有 8 例伪造数据。在 18 个综合数据元素中,GPT-4o 出错 9 次,准确率为 50%。平均而言,GPT-4o 抽取数据所需的时间比研究小组抽取数据所需的时间少,但如果考虑到制定提示和识别/纠正错误的时间,节省的资源就会减少:结论:探索使用生成式人工智能工具的公共卫生专业人员应以谨慎好奇的态度对待这种方法,并考虑节省资源与数据准确性之间的潜在权衡。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Content Analysis of Social Determinants of Health Accelerator Plans Using Artificial Intelligence: A Use Case for Public Health Practitioners.

Context: Public health practice involves the development of reports and plans, including funding progress reports, strategic plans, and community needs assessments. These documents are valuable data sources for program monitoring and evaluation. However, practitioners rarely have the bandwidth to thoroughly and rapidly review large amounts of primarily qualitative data to support real-time and continuous program improvement. Systematically examining and categorizing qualitative data through content analysis is labor-intensive. Large language models (LLMs), a type of generative artificial intelligence (AI) focused on language-based tasks, hold promise for expediting content analysis of public health documents, which, in turn, could facilitate continuous program improvement.

Objectives: To explore the feasibility and potential of using LLMs to expedite content analysis of real-world public health documents. The focus was on comparing semiautomated outputs from GPT-4o with human outputs for abstracting and synthesizing information from health improvement plans.

Design: Our study team conducted a content analysis of 4 publicly available community health improvement plans and compared the results with GPT-4o's performance on 20 data elements. We also assessed the resources required for both methods, including time spent on prompt engineering and error correction.

Main outcome measures: Accuracy of data abstraction and time required.

Results: GPT-4o demonstrated abstraction accuracy of 79% (n = 17 errors) compared to 94% accuracy by the study team for individual plans, with 8 instances of falsified data. Out of the 18 synthesis data elements, GPT-4o made 9 errors, demonstrating an accuracy of 50%. On average, GPT-4o abstraction required fewer hours than study team abstraction, but resource savings diminished when accounting for time for developing prompts and identifying/correcting errors.

Conclusions: Public health professionals who explore the use of generative AI tools should approach the method with cautious curiosity and consider the potential tradeoffs between resource savings and data accuracy.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Public Health Management and Practice
Journal of Public Health Management and Practice PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH-
CiteScore
3.40
自引率
9.10%
发文量
287
期刊介绍: Journal of Public Health Management and Practice publishes articles which focus on evidence based public health practice and research. The journal is a bi-monthly peer-reviewed publication guided by a multidisciplinary editorial board of administrators, practitioners and scientists. Journal of Public Health Management and Practice publishes in a wide range of population health topics including research to practice; emergency preparedness; bioterrorism; infectious disease surveillance; environmental health; community health assessment, chronic disease prevention and health promotion, and academic-practice linkages.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信