AI-based toxicity prediction models using ToxCast data: Current status and future directions for explainable models

IF 4.8 3区 医学 Q1 PHARMACOLOGY & PHARMACY
Donghyeon Kim , Jinhee Choi
{"title":"AI-based toxicity prediction models using ToxCast data: Current status and future directions for explainable models","authors":"Donghyeon Kim ,&nbsp;Jinhee Choi","doi":"10.1016/j.tox.2025.154230","DOIUrl":null,"url":null,"abstract":"<div><div>Artificial intelligence (AI) offers new opportunities for developing toxicity prediction models to screen environmental chemicals. U.S. EPA’s ToxCast program provides one of the largest toxicological databases and has consequently become the most widely used data source for developing AI-driven models. ToxCast In this review, we analyzed 93 peer-reviewed papers published since 2015 to provide an overview of ToxCast data-based AI models. We overviewed the current landscape in terms of database structure, target endpoints, molecular representations, and learning algorithms. Most models focus on data-rich endpoints and organ-specific toxicity mechanisms, particularly endocrine disruption and hepatotoxicity. While conventional molecular fingerprints and descriptors are still common, recent studies employ alternative representations—graphs, images, and text—leveraging advances in deep learning. Likewise, traditional supervised machine-learning algorithms remain prevalent, but newer work increasingly adopts semi- and unsupervised approaches to tackle data-sparsity challenges. Beyond classical structure-based QSAR, ToxCast data are also being used as biological features to predict in vivo toxicity. We conclude by discussing current limitations and future directions for applying ToxCast-based AI models to accelerate next-generation risk assessment (NGRA).</div></div>","PeriodicalId":23159,"journal":{"name":"Toxicology","volume":"517 ","pages":"Article 154230"},"PeriodicalIF":4.8000,"publicationDate":"2025-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Toxicology","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0300483X25001891","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PHARMACOLOGY & PHARMACY","Score":null,"Total":0}
引用次数: 0

Abstract

Artificial intelligence (AI) offers new opportunities for developing toxicity prediction models to screen environmental chemicals. U.S. EPA’s ToxCast program provides one of the largest toxicological databases and has consequently become the most widely used data source for developing AI-driven models. ToxCast In this review, we analyzed 93 peer-reviewed papers published since 2015 to provide an overview of ToxCast data-based AI models. We overviewed the current landscape in terms of database structure, target endpoints, molecular representations, and learning algorithms. Most models focus on data-rich endpoints and organ-specific toxicity mechanisms, particularly endocrine disruption and hepatotoxicity. While conventional molecular fingerprints and descriptors are still common, recent studies employ alternative representations—graphs, images, and text—leveraging advances in deep learning. Likewise, traditional supervised machine-learning algorithms remain prevalent, but newer work increasingly adopts semi- and unsupervised approaches to tackle data-sparsity challenges. Beyond classical structure-based QSAR, ToxCast data are also being used as biological features to predict in vivo toxicity. We conclude by discussing current limitations and future directions for applying ToxCast-based AI models to accelerate next-generation risk assessment (NGRA).
使用ToxCast数据的基于ai的毒性预测模型:可解释模型的现状和未来方向。
人工智能(AI)为开发毒性预测模型以筛选环境化学物质提供了新的机会。美国环保署的ToxCast项目提供了最大的毒理学数据库之一,因此成为开发人工智能驱动模型最广泛使用的数据源。在这篇综述中,我们分析了自2015年以来发表的93篇同行评议论文,概述了基于ToxCast数据的人工智能模型。我们从数据库结构、目标端点、分子表示和学习算法等方面概述了当前的情况。大多数模型侧重于数据丰富的端点和器官特异性毒性机制,特别是内分泌干扰和肝毒性。虽然传统的分子指纹和描述符仍然很常见,但最近的研究利用深度学习的进步,采用了图形、图像和文本等替代表示。同样,传统的监督式机器学习算法仍然普遍存在,但新的工作越来越多地采用半监督和无监督的方法来解决数据稀疏性挑战。除了经典的基于结构的QSAR, ToxCast数据也被用作预测体内毒性的生物学特征。最后,我们讨论了应用基于toxcast的人工智能模型加速下一代风险评估(NGRA)的当前局限性和未来方向。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Toxicology
Toxicology 医学-毒理学
CiteScore
7.80
自引率
4.40%
发文量
222
审稿时长
23 days
期刊介绍: Toxicology is an international, peer-reviewed journal that publishes only the highest quality original scientific research and critical reviews describing hypothesis-based investigations into mechanisms of toxicity associated with exposures to xenobiotic chemicals, particularly as it relates to human health. In this respect "mechanisms" is defined on both the macro (e.g. physiological, biological, kinetic, species, sex, etc.) and molecular (genomic, transcriptomic, metabolic, etc.) scale. Emphasis is placed on findings that identify novel hazards and that can be extrapolated to exposures and mechanisms that are relevant to estimating human risk. Toxicology also publishes brief communications, personal commentaries and opinion articles, as well as concise expert reviews on contemporary topics. All research and review articles published in Toxicology are subject to rigorous peer review. Authors are asked to contact the Editor-in-Chief prior to submitting review articles or commentaries for consideration for publication in Toxicology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信