A deep-learning approach to predict reproductive toxicity of chemicals using communicative message passing neural network.

IF 4.6 Q2 TOXICOLOGY
Frontiers in toxicology Pub Date : 2025-07-22 eCollection Date: 2025-01-01 DOI:10.3389/ftox.2025.1640612
Owen He, Daoxing Chen, Yimei Li
{"title":"A deep-learning approach to predict reproductive toxicity of chemicals using communicative message passing neural network.","authors":"Owen He, Daoxing Chen, Yimei Li","doi":"10.3389/ftox.2025.1640612","DOIUrl":null,"url":null,"abstract":"<p><p>Reproductive toxicity is a concern critical to human health and chemical safety assessment. Recently, the U.S. Food and Drug Administration announced plans to assess toxicity with artificial intelligence-based computational models instead of animal studies in \"a win-win for public health and ethics.\" In this study, we used a reproductive toxicity dataset using Simplified Molecular Input Line Entry Specifications (SMILES) to represent 1091 reproductively toxic and 1063 non-toxic small-molecule compounds. A repeated nested cross-validation procedure was applied, in which the dataset was randomly partitioned into five distinct folds in the outer loop, each time, one fold serving as the test set. In the inner loop, a similar procedure was also repeated five times, with 12.5% each time serving as the validation set. We first evaluated the performance of classical machine learning (ML) methods such as Random Forest and Extreme Gradient Boosting on predicting reproductive toxicity, using standard model evaluation metrics including accuracy score (ACC), the area under the curve (AUC) of the receiver operating characteristics curve (ROC) and F1 score. Our analyses indicate that these methods' overall results were mediocre and insufficient for high-throughput screening. To overcome these limitations, we adopted the Communicative Message Passing Neural Network (CMPNN) framework, which incorporates a communicative kernel and a message booster module. Our results show that our ReproTox-CMPNN model outperforms the current best baselines in both embedding quality and predictive accuracy. In independent test sets, ReproTox-CMPNN achieved a mean AUC of 0.946, ACC of 0.857 and F1 score of 0.846, surpassing traditional algorithms to establish itself as a new state-of-the-art model in this field. These findings demonstrate that CMPNN's deep capture of multi-level molecular relationships offers an efficient and reliable computational tool for rapid chemical safety screening and risk assessment.</p>","PeriodicalId":73111,"journal":{"name":"Frontiers in toxicology","volume":"7 ","pages":"1640612"},"PeriodicalIF":4.6000,"publicationDate":"2025-07-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12321834/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in toxicology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3389/ftox.2025.1640612","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"TOXICOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Reproductive toxicity is a concern critical to human health and chemical safety assessment. Recently, the U.S. Food and Drug Administration announced plans to assess toxicity with artificial intelligence-based computational models instead of animal studies in "a win-win for public health and ethics." In this study, we used a reproductive toxicity dataset using Simplified Molecular Input Line Entry Specifications (SMILES) to represent 1091 reproductively toxic and 1063 non-toxic small-molecule compounds. A repeated nested cross-validation procedure was applied, in which the dataset was randomly partitioned into five distinct folds in the outer loop, each time, one fold serving as the test set. In the inner loop, a similar procedure was also repeated five times, with 12.5% each time serving as the validation set. We first evaluated the performance of classical machine learning (ML) methods such as Random Forest and Extreme Gradient Boosting on predicting reproductive toxicity, using standard model evaluation metrics including accuracy score (ACC), the area under the curve (AUC) of the receiver operating characteristics curve (ROC) and F1 score. Our analyses indicate that these methods' overall results were mediocre and insufficient for high-throughput screening. To overcome these limitations, we adopted the Communicative Message Passing Neural Network (CMPNN) framework, which incorporates a communicative kernel and a message booster module. Our results show that our ReproTox-CMPNN model outperforms the current best baselines in both embedding quality and predictive accuracy. In independent test sets, ReproTox-CMPNN achieved a mean AUC of 0.946, ACC of 0.857 and F1 score of 0.846, surpassing traditional algorithms to establish itself as a new state-of-the-art model in this field. These findings demonstrate that CMPNN's deep capture of multi-level molecular relationships offers an efficient and reliable computational tool for rapid chemical safety screening and risk assessment.

Abstract Image

Abstract Image

Abstract Image

一种利用交流信息传递神经网络预测化学品生殖毒性的深度学习方法。
生殖毒性是一个对人类健康和化学品安全评估至关重要的问题。最近,美国食品和药物管理局宣布计划用基于人工智能的计算模型来评估毒性,而不是动物研究,这是“公共卫生和道德的双赢”。在这项研究中,我们使用了一个使用简化分子输入线输入规范(SMILES)的生殖毒性数据集来代表1091种生殖毒性和1063种无毒的小分子化合物。应用重复嵌套交叉验证程序,其中数据集在外部循环中随机划分为五个不同的折叠,每次折叠一个作为测试集。在内部循环中,类似的过程也重复了五次,每次使用12.5%作为验证集。我们首先使用标准模型评估指标,包括准确性评分(ACC)、受试者工作特征曲线(ROC)曲线下面积(AUC)和F1评分,评估了随机森林和极端梯度增强等经典机器学习(ML)方法在预测生殖毒性方面的性能。我们的分析表明,这些方法的总体结果一般,不足以进行高通量筛选。为了克服这些限制,我们采用了通信消息传递神经网络(CMPNN)框架,该框架包含一个通信内核和一个消息增强模块。我们的结果表明,我们的ReproTox-CMPNN模型在嵌入质量和预测精度方面都优于目前最好的基线。在独立测试集中,ReproTox-CMPNN的平均AUC为0.946,ACC为0.857,F1得分为0.846,超越了传统算法,成为该领域最先进的新模型。这些发现表明,CMPNN对多层次分子关系的深入捕捉为快速化学品安全筛选和风险评估提供了一种高效可靠的计算工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
3.80
自引率
0.00%
发文量
0
审稿时长
13 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信