Consensus Modeling for Predicting Chemical Binding to Transthyretin as the Winning Solution of the Tox24 Challenge.

IF 3.7 3区 医学 Q2 CHEMISTRY, MEDICINAL
Chemical Research in Toxicology Pub Date : 2025-03-17 Epub Date: 2025-02-19 DOI:10.1021/acs.chemrestox.4c00421
Dmitriy M Makarov, Alexander A Ksenofontov, Yury A Budkov
{"title":"Consensus Modeling for Predicting Chemical Binding to Transthyretin as the Winning Solution of the Tox24 Challenge.","authors":"Dmitriy M Makarov, Alexander A Ksenofontov, Yury A Budkov","doi":"10.1021/acs.chemrestox.4c00421","DOIUrl":null,"url":null,"abstract":"<p><p>The utilization of predictive methodologies for the assessment of toxicological properties represents an alternative approach that facilitates the identification of safe compounds while concurrently reducing the financial costs associated with the process. The objective of the Tox24 Challenge was to assess the progress in computational methods for predicting the activity of chemical binding to transthyretin (TTR). In order to fulfill the requirements of this task, the data set, measured by the Environmental Protection Agency, consisted of 1512 chemical substances of diverse nature. This paper describes the model that won the Tox24 Challenge and the steps taken for its further improvement. The Transformer convolutional neural network (CNN) model achieved the best performance as a standalone solution. Meanwhile, a multitask model built on a graph CNN, trained using 11 additional acute systemic toxicity data sets with increased weighting on the TTR binding activity, showed comparable results on the blind test set. The winning solution was a consensus model consisting of two catBoost models with OEstate and Mold2 descriptor sets, as well as two transformer-based models. The improvement of this solution involved adding a fifth model based on multitask learning using the graph CNN method, which led to a reduction in RMSE on the blind test set to 20.3%. The winning model was developed using the OCHEM web platform and is available online at https://ochem.eu/article/162082.</p>","PeriodicalId":31,"journal":{"name":"Chemical Research in Toxicology","volume":" ","pages":"392-399"},"PeriodicalIF":3.7000,"publicationDate":"2025-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chemical Research in Toxicology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1021/acs.chemrestox.4c00421","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/2/19 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"CHEMISTRY, MEDICINAL","Score":null,"Total":0}
引用次数: 0

Abstract

The utilization of predictive methodologies for the assessment of toxicological properties represents an alternative approach that facilitates the identification of safe compounds while concurrently reducing the financial costs associated with the process. The objective of the Tox24 Challenge was to assess the progress in computational methods for predicting the activity of chemical binding to transthyretin (TTR). In order to fulfill the requirements of this task, the data set, measured by the Environmental Protection Agency, consisted of 1512 chemical substances of diverse nature. This paper describes the model that won the Tox24 Challenge and the steps taken for its further improvement. The Transformer convolutional neural network (CNN) model achieved the best performance as a standalone solution. Meanwhile, a multitask model built on a graph CNN, trained using 11 additional acute systemic toxicity data sets with increased weighting on the TTR binding activity, showed comparable results on the blind test set. The winning solution was a consensus model consisting of two catBoost models with OEstate and Mold2 descriptor sets, as well as two transformer-based models. The improvement of this solution involved adding a fifth model based on multitask learning using the graph CNN method, which led to a reduction in RMSE on the blind test set to 20.3%. The winning model was developed using the OCHEM web platform and is available online at https://ochem.eu/article/162082.

预测转甲状腺素化学结合的共识模型是Tox24挑战的制胜方案。
利用预测方法评估毒理学特性是一种替代方法,它有助于确定安全化合物,同时减少与该过程相关的财务成本。Tox24挑战赛的目的是评估用于预测甲状腺转甲状腺素(TTR)化学结合活性的计算方法的进展。为了完成这项任务的要求,环境保护局测量的数据集包括1512种不同性质的化学物质。本文介绍了赢得Tox24挑战赛的模型以及进一步改进的步骤。Transformer卷积神经网络(CNN)模型作为独立解决方案获得了最佳性能。同时,建立在CNN图上的多任务模型,使用11个额外的急性全身毒性数据集进行训练,增加了TTR结合活性的权重,在盲测集上显示出类似的结果。获胜的解决方案是一个共识模型,该模型由两个catBoost模型(带有OEstate和Mold2描述符集)以及两个基于变压器的模型组成。该解决方案的改进涉及使用图CNN方法添加基于多任务学习的第五个模型,这导致盲测试集的RMSE降低到20.3%。获奖模型是使用OCHEM网络平台开发的,可在https://ochem.eu/article/162082上在线获取。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.90
自引率
7.30%
发文量
215
审稿时长
3.5 months
期刊介绍: Chemical Research in Toxicology publishes Articles, Rapid Reports, Chemical Profiles, Reviews, Perspectives, Letters to the Editor, and ToxWatch on a wide range of topics in Toxicology that inform a chemical and molecular understanding and capacity to predict biological outcomes on the basis of structures and processes. The overarching goal of activities reported in the Journal are to provide knowledge and innovative approaches needed to promote intelligent solutions for human safety and ecosystem preservation. The journal emphasizes insight concerning mechanisms of toxicity over phenomenological observations. It upholds rigorous chemical, physical and mathematical standards for characterization and application of modern techniques.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信