Kelli DePriest, John Feher, Kailen Gore, LaShawn Glasgow, Clint Grant, Peter Holtgrave, Karen Hacker, Robert Chew
{"title":"Content Analysis of Social Determinants of Health Accelerator Plans Using Artificial Intelligence: A Use Case for Public Health Practitioners.","authors":"Kelli DePriest, John Feher, Kailen Gore, LaShawn Glasgow, Clint Grant, Peter Holtgrave, Karen Hacker, Robert Chew","doi":"10.1097/PHH.0000000000002148","DOIUrl":null,"url":null,"abstract":"<p><strong>Context: </strong>Public health practice involves the development of reports and plans, including funding progress reports, strategic plans, and community needs assessments. These documents are valuable data sources for program monitoring and evaluation. However, practitioners rarely have the bandwidth to thoroughly and rapidly review large amounts of primarily qualitative data to support real-time and continuous program improvement. Systematically examining and categorizing qualitative data through content analysis is labor-intensive. Large language models (LLMs), a type of generative artificial intelligence (AI) focused on language-based tasks, hold promise for expediting content analysis of public health documents, which, in turn, could facilitate continuous program improvement.</p><p><strong>Objectives: </strong>To explore the feasibility and potential of using LLMs to expedite content analysis of real-world public health documents. The focus was on comparing semiautomated outputs from GPT-4o with human outputs for abstracting and synthesizing information from health improvement plans.</p><p><strong>Design: </strong>Our study team conducted a content analysis of 4 publicly available community health improvement plans and compared the results with GPT-4o's performance on 20 data elements. We also assessed the resources required for both methods, including time spent on prompt engineering and error correction.</p><p><strong>Main outcome measures: </strong>Accuracy of data abstraction and time required.</p><p><strong>Results: </strong>GPT-4o demonstrated abstraction accuracy of 79% (n = 17 errors) compared to 94% accuracy by the study team for individual plans, with 8 instances of falsified data. Out of the 18 synthesis data elements, GPT-4o made 9 errors, demonstrating an accuracy of 50%. On average, GPT-4o abstraction required fewer hours than study team abstraction, but resource savings diminished when accounting for time for developing prompts and identifying/correcting errors.</p><p><strong>Conclusions: </strong>Public health professionals who explore the use of generative AI tools should approach the method with cautious curiosity and consider the potential tradeoffs between resource savings and data accuracy.</p>","PeriodicalId":47855,"journal":{"name":"Journal of Public Health Management and Practice","volume":" ","pages":""},"PeriodicalIF":2.2000,"publicationDate":"2025-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Public Health Management and Practice","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1097/PHH.0000000000002148","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
Content Analysis of Social Determinants of Health Accelerator Plans Using Artificial Intelligence: A Use Case for Public Health Practitioners.
Context: Public health practice involves the development of reports and plans, including funding progress reports, strategic plans, and community needs assessments. These documents are valuable data sources for program monitoring and evaluation. However, practitioners rarely have the bandwidth to thoroughly and rapidly review large amounts of primarily qualitative data to support real-time and continuous program improvement. Systematically examining and categorizing qualitative data through content analysis is labor-intensive. Large language models (LLMs), a type of generative artificial intelligence (AI) focused on language-based tasks, hold promise for expediting content analysis of public health documents, which, in turn, could facilitate continuous program improvement.
Objectives: To explore the feasibility and potential of using LLMs to expedite content analysis of real-world public health documents. The focus was on comparing semiautomated outputs from GPT-4o with human outputs for abstracting and synthesizing information from health improvement plans.
Design: Our study team conducted a content analysis of 4 publicly available community health improvement plans and compared the results with GPT-4o's performance on 20 data elements. We also assessed the resources required for both methods, including time spent on prompt engineering and error correction.
Main outcome measures: Accuracy of data abstraction and time required.
Results: GPT-4o demonstrated abstraction accuracy of 79% (n = 17 errors) compared to 94% accuracy by the study team for individual plans, with 8 instances of falsified data. Out of the 18 synthesis data elements, GPT-4o made 9 errors, demonstrating an accuracy of 50%. On average, GPT-4o abstraction required fewer hours than study team abstraction, but resource savings diminished when accounting for time for developing prompts and identifying/correcting errors.
Conclusions: Public health professionals who explore the use of generative AI tools should approach the method with cautious curiosity and consider the potential tradeoffs between resource savings and data accuracy.
期刊介绍:
Journal of Public Health Management and Practice publishes articles which focus on evidence based public health practice and research. The journal is a bi-monthly peer-reviewed publication guided by a multidisciplinary editorial board of administrators, practitioners and scientists. Journal of Public Health Management and Practice publishes in a wide range of population health topics including research to practice; emergency preparedness; bioterrorism; infectious disease surveillance; environmental health; community health assessment, chronic disease prevention and health promotion, and academic-practice linkages.