Valsci: an open-source, self-hostable literature review utility for automated large-batch scientific claim verification using large language models.

IF 3.3 3区生物学 Q2 BIOCHEMICAL RESEARCH METHODS

BMC Bioinformatics Pub Date : 2025-05-28 DOI:10.1186/s12859-025-06159-4

Brice Edelman, Jeffrey Skolnick

{"title":"Valsci: an open-source, self-hostable literature review utility for automated large-batch scientific claim verification using large language models.","authors":"Brice Edelman, Jeffrey Skolnick","doi":"10.1186/s12859-025-06159-4","DOIUrl":null,"url":null,"abstract":"Background: The exponential growth of scientific publications poses a formidable challenge for researchers seeking to validate emerging hypotheses or synthesize existing evidence. In this paper, we introduce Valsci, an open-source, self-hostable utility that automates large-batch scientific claim verification using any OpenAI-compatible large language model. Valsci unites retrieval-augmented generation with structured bibliometric scoring and chain-of-thought prompting, enabling users to efficiently search, evaluate, and summarize evidence from the Semantic Scholar database and other academic sources. Unlike conventional standalone LLMs, which often suffer from hallucinations and unreliable citations, Valsci grounds its analyses in verifiable published findings. A guided prompt-flow approach is employed to generate query expansions, retrieve relevant excerpts, and synthesize coherent, evidence-based reports.Results: Preliminary evaluations across claims from the SciFact benchmark dataset reveal that Valsci significantly outperforms base GPT-4o outputs in citation hallucination rate while maintaining a low misclassification rate. The system is highly scalable, processing hundreds of claims per hour through asynchronous parallelization.Conclusions: By providing an open and transparent platform for large-batch literature verification, Valsci substantially lowers the barrier to comprehensive evidence-based reviews and fosters a more reproducible research ecosystem.","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":"26 1","pages":"140"},"PeriodicalIF":3.3000,"publicationDate":"2025-05-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12121171/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s12859-025-06159-4","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}

引用次数: 0

Abstract

Background: The exponential growth of scientific publications poses a formidable challenge for researchers seeking to validate emerging hypotheses or synthesize existing evidence. In this paper, we introduce Valsci, an open-source, self-hostable utility that automates large-batch scientific claim verification using any OpenAI-compatible large language model. Valsci unites retrieval-augmented generation with structured bibliometric scoring and chain-of-thought prompting, enabling users to efficiently search, evaluate, and summarize evidence from the Semantic Scholar database and other academic sources. Unlike conventional standalone LLMs, which often suffer from hallucinations and unreliable citations, Valsci grounds its analyses in verifiable published findings. A guided prompt-flow approach is employed to generate query expansions, retrieve relevant excerpts, and synthesize coherent, evidence-based reports.

Results: Preliminary evaluations across claims from the SciFact benchmark dataset reveal that Valsci significantly outperforms base GPT-4o outputs in citation hallucination rate while maintaining a low misclassification rate. The system is highly scalable, processing hundreds of claims per hour through asynchronous parallelization.

Conclusions: By providing an open and transparent platform for large-batch literature verification, Valsci substantially lowers the barrier to comprehensive evidence-based reviews and fosters a more reproducible research ecosystem.

查看原文本刊更多论文

Valsci：一个开源的、自托管的文献回顾工具，用于使用大型语言模型进行自动化的大批量科学声明验证。

背景：科学出版物的指数级增长对寻求验证新出现的假设或综合现有证据的研究人员提出了巨大的挑战。在本文中，我们介绍了Valsci，这是一个开源的、自托管的实用程序，可以使用任何与openai兼容的大型语言模型自动进行大规模科学声明验证。Valsci将检索增强生成与结构化文献计量评分和思维链提示结合起来，使用户能够有效地搜索、评估和总结来自Semantic Scholar数据库和其他学术来源的证据。与传统的独立法学硕士不同的是，法学硕士经常遭受幻觉和不可靠的引用，Valsci的分析基于可验证的已发表的研究结果。采用一种引导的提示流方法来生成查询扩展、检索相关摘要和合成连贯的、基于证据的报告。结果：对来自SciFact基准数据集的索赔的初步评估显示，Valsci在引文幻觉率方面明显优于基本gpt - 40输出，同时保持较低的误分类率。该系统具有高度可扩展性，通过异步并行化每小时处理数百个索赔。结论：通过为大规模文献验证提供一个公开透明的平台，Valsci大大降低了全面循证评价的门槛，并建立了一个更具可重复性的研究生态系统。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

BMC Bioinformatics 生物-生化研究方法

CiteScore

5.70

自引率

3.30%

发文量

506

审稿时长

4.3 months

期刊介绍： BMC Bioinformatics is an open access, peer-reviewed journal that considers articles on all aspects of the development, testing and novel application of computational and statistical methods for the modeling and analysis of all kinds of biological data, as well as other areas of computational biology. BMC Bioinformatics is part of the BMC series which publishes subject-specific journals focused on the needs of individual research communities across all areas of biology and medicine. We offer an efficient, fair and friendly peer review service, and are committed to publishing all sound science, provided that there is some advance in knowledge presented by the work.