An AI assistant for critically assessing and synthesizing clusters of journal articles

Louis Anthony Cox Jr.
{"title":"An AI assistant for critically assessing and synthesizing clusters of journal articles","authors":"Louis Anthony Cox Jr.","doi":"10.1016/j.gloepi.2025.100207","DOIUrl":null,"url":null,"abstract":"<div><div>Current large language models (LLMs) face significant challenges in attempting to synthesize and critically assess conflicting causal claims in scientific literature about exposure-associated health effects. This paper examines the design and performance of AIA2, an experimental AI system (freely available at <span><span>http://cloud.cox-associates.com/</span><svg><path></path></svg></span>) designed to help explore and illustrate potential applications of current AI in assisting analysis of clusters of related scientific articles, focusing on causal claims in complex domains such as epidemiology, toxicology, and risk analysis. Building on an earlier AI assistant, AIA1, which critically reviewed causal claims in individual papers, AIA2 advances the approach by systematically comparing multiple studies to identify areas of agreement and disagreement, suggest explanations for differences in conclusions, flag methodological gaps and inconsistencies, synthesize and summarize well-supported conclusions despite conflicts, and propose recommendations to help resolve knowledge gaps. We illustrate these capabilities with a case study of formaldehyde exposure and leukemia using a cluster of four papers that feature very different approaches and partly conflicting conclusions. AIA2 successfully identifies major points of agreement and contention, discusses the robustness of the evidence for causal claims, and recommends future research directions to address current uncertainties. AIA2's outputs suggest that current AI can offer a promising, practicable approach to AI-assisted review of clusters of papers, promoting methodological rigor, thoroughness, and transparency in review and synthesis, notwithstanding current limitations of LLMs. We discuss the implications of AI-assisted literature review systems for improving evidence-based decision-making, resolving conflicting scientific claims, and promoting rigor and reproducibility in causal research and health risk analysis.</div></div>","PeriodicalId":36311,"journal":{"name":"Global Epidemiology","volume":"10 ","pages":"Article 100207"},"PeriodicalIF":0.0000,"publicationDate":"2025-05-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Global Epidemiology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2590113325000252","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Current large language models (LLMs) face significant challenges in attempting to synthesize and critically assess conflicting causal claims in scientific literature about exposure-associated health effects. This paper examines the design and performance of AIA2, an experimental AI system (freely available at http://cloud.cox-associates.com/) designed to help explore and illustrate potential applications of current AI in assisting analysis of clusters of related scientific articles, focusing on causal claims in complex domains such as epidemiology, toxicology, and risk analysis. Building on an earlier AI assistant, AIA1, which critically reviewed causal claims in individual papers, AIA2 advances the approach by systematically comparing multiple studies to identify areas of agreement and disagreement, suggest explanations for differences in conclusions, flag methodological gaps and inconsistencies, synthesize and summarize well-supported conclusions despite conflicts, and propose recommendations to help resolve knowledge gaps. We illustrate these capabilities with a case study of formaldehyde exposure and leukemia using a cluster of four papers that feature very different approaches and partly conflicting conclusions. AIA2 successfully identifies major points of agreement and contention, discusses the robustness of the evidence for causal claims, and recommends future research directions to address current uncertainties. AIA2's outputs suggest that current AI can offer a promising, practicable approach to AI-assisted review of clusters of papers, promoting methodological rigor, thoroughness, and transparency in review and synthesis, notwithstanding current limitations of LLMs. We discuss the implications of AI-assisted literature review systems for improving evidence-based decision-making, resolving conflicting scientific claims, and promoting rigor and reproducibility in causal research and health risk analysis.
用于批判性地评估和综合期刊文章集群的人工智能助手
目前的大型语言模型(llm)在试图综合和批判性评估科学文献中关于暴露相关健康影响的相互矛盾的因果关系主张方面面临重大挑战。本文研究了AIA2的设计和性能,AIA2是一个实验性人工智能系统(可在http://cloud.cox-associates.com/免费获得),旨在帮助探索和说明当前人工智能在辅助分析相关科学文章集群方面的潜在应用,重点关注流行病学、毒理学和风险分析等复杂领域的因果关系主张。在早期人工智能助手AIA1的基础上,AIA2通过系统地比较多个研究来确定一致和不一致的领域,提出结论差异的解释,标记方法上的差距和不一致,综合和总结有充分支持的结论,尽管存在冲突,并提出建议来帮助解决知识差距。我们用甲醛暴露和白血病的案例研究说明了这些能力,使用四篇论文的集群,具有非常不同的方法和部分矛盾的结论。AIA2成功地识别了主要的共识和争论点,讨论了因果主张证据的稳健性,并建议未来的研究方向以解决当前的不确定性。AIA2的产出表明,尽管目前法学硕士存在局限性,但目前的人工智能可以提供一种有前途的、可行的方法来辅助人工智能对论文群的审查,促进审查和综合的方法严谨性、彻全性和透明度。我们讨论了人工智能辅助文献综述系统在改善循证决策、解决科学主张冲突、促进因果研究和健康风险分析的严谨性和可重复性方面的意义。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Global Epidemiology
Global Epidemiology Medicine-Infectious Diseases
CiteScore
5.00
自引率
0.00%
发文量
22
审稿时长
39 days
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信