一个基于语言模型的大型工具,用于识别苯、钴和阿斯巴甜致癌性研究中与工业的关系。

IF 5.3 2区 环境科学与生态学 Q1 ENVIRONMENTAL SCIENCES
Nathan L DeBono, Vanessa Amar, Hardy Hardy, Mary K Schubauer-Berigan, Derek Ruths, Nicholas B King
{"title":"一个基于语言模型的大型工具,用于识别苯、钴和阿斯巴甜致癌性研究中与工业的关系。","authors":"Nathan L DeBono, Vanessa Amar, Hardy Hardy, Mary K Schubauer-Berigan, Derek Ruths, Nicholas B King","doi":"10.1186/s12940-025-01223-1","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Industry-funded research poses a threat to the validity of scientific inference on carcinogenic hazards. Scientists require tools to better identify and characterize industry sponsored research across bodies of evidence to reduce the possible influence of industry bias in evidence synthesis reviews. We applied a novel large language model (LLM)-based tool named InfluenceMapper to demonstrate and evaluate its performance in identifying relationships to industry in research on the carcinogenicity of benzene, cobalt, and aspartame.</p><p><strong>Methods: </strong>All epidemiological, animal cancer, and mechanistic studies included in systematic reviews on the carcinogenicity of the three agents by the IARC Monographs programme. Selected agents were recently evaluated by the Monographs and are of commercial interest by major industries. InfluenceMapper extracted disclosed entities in study publications and classified up to 40 possible disclosed relationship types between each entity and the study and between each entity and author. A human classified entities as 'industry or industry-funded' and assessed relationships with industry for potential conflicts of interest. Positive predictive values described the extent of true positive relationships identified by InfluenceMapper compared to human assessment.</p><p><strong>Results: </strong>Analyses included 2,046 studies for all three agents. We identified 320 disclosed industry or industry-funded entities from InfluenceMapper output that were involved in 770 distinct study-entity and author-entity relationships. For each agent, between 4 and 8% of studies disclosed funding by industry and 1-4% of studies had at least one author who disclosed receiving industry funding directly. Industry trade associations for all three agents funded 22 studies published in 16 journals over a 37-year span. Aside from funding, the most prevalent disclosed relationships with industry were receiving data, holding employment, paid consulting, and providing expert testimony. Positive predictive values were excellent (≥ 98%) for study-entity relationships but declined for relationships with individual authors.</p><p><strong>Conclusions: </strong>LLM-based tools can significantly expedite and bolster the detection of disclosed conflicts of interest from industry sponsored research in cancer prevention. Possible use cases include facilitating the assessment of bias from industry studies in evidence synthesis reviews and alerting scientists to the influence of industry on scientific inference. Persistent challenges in ascertaining conflicts of interest underscore the urgent need for standardized, transparent, and enforceable disclosures in biomedical journals.</p>","PeriodicalId":11686,"journal":{"name":"Environmental Health","volume":"24 1","pages":"64"},"PeriodicalIF":5.3000,"publicationDate":"2025-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12462328/pdf/","citationCount":"0","resultStr":"{\"title\":\"A large language model-based tool for identifying relationships to industry in research on the carcinogenicity of benzene, cobalt, and aspartame.\",\"authors\":\"Nathan L DeBono, Vanessa Amar, Hardy Hardy, Mary K Schubauer-Berigan, Derek Ruths, Nicholas B King\",\"doi\":\"10.1186/s12940-025-01223-1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Industry-funded research poses a threat to the validity of scientific inference on carcinogenic hazards. Scientists require tools to better identify and characterize industry sponsored research across bodies of evidence to reduce the possible influence of industry bias in evidence synthesis reviews. We applied a novel large language model (LLM)-based tool named InfluenceMapper to demonstrate and evaluate its performance in identifying relationships to industry in research on the carcinogenicity of benzene, cobalt, and aspartame.</p><p><strong>Methods: </strong>All epidemiological, animal cancer, and mechanistic studies included in systematic reviews on the carcinogenicity of the three agents by the IARC Monographs programme. Selected agents were recently evaluated by the Monographs and are of commercial interest by major industries. InfluenceMapper extracted disclosed entities in study publications and classified up to 40 possible disclosed relationship types between each entity and the study and between each entity and author. A human classified entities as 'industry or industry-funded' and assessed relationships with industry for potential conflicts of interest. Positive predictive values described the extent of true positive relationships identified by InfluenceMapper compared to human assessment.</p><p><strong>Results: </strong>Analyses included 2,046 studies for all three agents. We identified 320 disclosed industry or industry-funded entities from InfluenceMapper output that were involved in 770 distinct study-entity and author-entity relationships. For each agent, between 4 and 8% of studies disclosed funding by industry and 1-4% of studies had at least one author who disclosed receiving industry funding directly. Industry trade associations for all three agents funded 22 studies published in 16 journals over a 37-year span. Aside from funding, the most prevalent disclosed relationships with industry were receiving data, holding employment, paid consulting, and providing expert testimony. Positive predictive values were excellent (≥ 98%) for study-entity relationships but declined for relationships with individual authors.</p><p><strong>Conclusions: </strong>LLM-based tools can significantly expedite and bolster the detection of disclosed conflicts of interest from industry sponsored research in cancer prevention. Possible use cases include facilitating the assessment of bias from industry studies in evidence synthesis reviews and alerting scientists to the influence of industry on scientific inference. Persistent challenges in ascertaining conflicts of interest underscore the urgent need for standardized, transparent, and enforceable disclosures in biomedical journals.</p>\",\"PeriodicalId\":11686,\"journal\":{\"name\":\"Environmental Health\",\"volume\":\"24 1\",\"pages\":\"64\"},\"PeriodicalIF\":5.3000,\"publicationDate\":\"2025-09-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12462328/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Environmental Health\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://doi.org/10.1186/s12940-025-01223-1\",\"RegionNum\":2,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Health","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1186/s12940-025-01223-1","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

摘要

背景:工业资助的研究对致癌危害的科学推断的有效性构成威胁。科学家需要工具来更好地识别和描述行业赞助的跨证据体研究,以减少行业偏见在证据综合审查中的可能影响。在苯、钴和阿斯巴甜的致癌性研究中,我们应用了一种名为InfluenceMapper的新型大型语言模型(LLM)为基础的工具来展示和评估其在识别与工业关系方面的表现。方法:国际癌症研究机构专著项目对这三种药物致癌性的系统评价中包括的所有流行病学、动物癌症和机制研究。选定的药剂最近由专著进行了评估,并且具有主要行业的商业利益。InfluenceMapper提取了研究出版物中披露的实体,并对每个实体与研究之间以及每个实体与作者之间可能披露的关系类型进行了多达40种的分类。人类将实体分类为“行业或行业资助”,并评估与行业的关系是否存在潜在的利益冲突。积极预测值描述了InfluenceMapper与人类评估相比确定的真正积极关系的程度。结果:分析包括所有三种药物的2046项研究。我们从InfluenceMapper的输出中确定了320个披露的行业或行业资助实体,它们涉及770种不同的研究实体和作者实体关系。对于每个机构,4% - 8%的研究披露了行业资助,1% -4%的研究至少有一位作者披露直接接受行业资助。这三种药物的行业协会在37年的时间里资助了22项研究,发表在16份期刊上。除了资金,最普遍的披露与行业的关系是接收数据、雇佣、有偿咨询和提供专家证词。研究实体关系的阳性预测值极好(≥98%),但与个体作者关系的阳性预测值下降。结论:基于法学硕士的工具可以显著加快和加强对行业赞助的癌症预防研究中披露的利益冲突的检测。可能的用例包括促进在证据综合审查中评估来自行业研究的偏见,并提醒科学家注意行业对科学推断的影响。在确定利益冲突方面的持续挑战强调了在生物医学期刊中进行标准化、透明和可执行的披露的迫切需要。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A large language model-based tool for identifying relationships to industry in research on the carcinogenicity of benzene, cobalt, and aspartame.

Background: Industry-funded research poses a threat to the validity of scientific inference on carcinogenic hazards. Scientists require tools to better identify and characterize industry sponsored research across bodies of evidence to reduce the possible influence of industry bias in evidence synthesis reviews. We applied a novel large language model (LLM)-based tool named InfluenceMapper to demonstrate and evaluate its performance in identifying relationships to industry in research on the carcinogenicity of benzene, cobalt, and aspartame.

Methods: All epidemiological, animal cancer, and mechanistic studies included in systematic reviews on the carcinogenicity of the three agents by the IARC Monographs programme. Selected agents were recently evaluated by the Monographs and are of commercial interest by major industries. InfluenceMapper extracted disclosed entities in study publications and classified up to 40 possible disclosed relationship types between each entity and the study and between each entity and author. A human classified entities as 'industry or industry-funded' and assessed relationships with industry for potential conflicts of interest. Positive predictive values described the extent of true positive relationships identified by InfluenceMapper compared to human assessment.

Results: Analyses included 2,046 studies for all three agents. We identified 320 disclosed industry or industry-funded entities from InfluenceMapper output that were involved in 770 distinct study-entity and author-entity relationships. For each agent, between 4 and 8% of studies disclosed funding by industry and 1-4% of studies had at least one author who disclosed receiving industry funding directly. Industry trade associations for all three agents funded 22 studies published in 16 journals over a 37-year span. Aside from funding, the most prevalent disclosed relationships with industry were receiving data, holding employment, paid consulting, and providing expert testimony. Positive predictive values were excellent (≥ 98%) for study-entity relationships but declined for relationships with individual authors.

Conclusions: LLM-based tools can significantly expedite and bolster the detection of disclosed conflicts of interest from industry sponsored research in cancer prevention. Possible use cases include facilitating the assessment of bias from industry studies in evidence synthesis reviews and alerting scientists to the influence of industry on scientific inference. Persistent challenges in ascertaining conflicts of interest underscore the urgent need for standardized, transparent, and enforceable disclosures in biomedical journals.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Environmental Health
Environmental Health 环境科学-公共卫生、环境卫生与职业卫生
CiteScore
10.10
自引率
1.70%
发文量
115
审稿时长
3.0 months
期刊介绍: Environmental Health publishes manuscripts on all aspects of environmental and occupational medicine and related studies in toxicology and epidemiology. Environmental Health is aimed at scientists and practitioners in all areas of environmental science where human health and well-being are involved, either directly or indirectly. Environmental Health is a public health journal serving the public health community and scientists working on matters of public health interest and importance pertaining to the environment.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信