Thalita Cirino, Luis Pinto, Mateusz Iwan, Alexis Dougha, Bono Lučić, Antonija Kraljević, Zaven Navoyan, Ani Tevosyan, Hrach Yeghiazaryan, Lusine Khondkaryan, Narek Abelyan, Vahe Atoyan, Nelly Babayan, Yuma Iwashita, Kyosuke Kimura, Tomoya Komasaka, Koki Shishido, Taichi Nakamura, Mizuho Asada, Sankalp Jain, Alexey V Zakharov, Haobo Wang, Wenjia Liu, Vladimir Chupakhin, Yoshihiro Uesawa
{"title":"Consensus Modeling Strategies for Predicting Transthyretin Binding Affinity from Tox24 Challenge Data.","authors":"Thalita Cirino, Luis Pinto, Mateusz Iwan, Alexis Dougha, Bono Lučić, Antonija Kraljević, Zaven Navoyan, Ani Tevosyan, Hrach Yeghiazaryan, Lusine Khondkaryan, Narek Abelyan, Vahe Atoyan, Nelly Babayan, Yuma Iwashita, Kyosuke Kimura, Tomoya Komasaka, Koki Shishido, Taichi Nakamura, Mizuho Asada, Sankalp Jain, Alexey V Zakharov, Haobo Wang, Wenjia Liu, Vladimir Chupakhin, Yoshihiro Uesawa","doi":"10.1021/acs.chemrestox.5c00018","DOIUrl":null,"url":null,"abstract":"<p><p>Transthyretin (TTR) is a key transporter of the thyroid hormone thyroxine, and chemicals that bind to TTR, displacing the hormone, can disrupt the endocrine system, even at low concentrations. This study evaluates computational modeling strategies developed during the Tox24 Challenge, using a data set of 1512 compounds tested for TTR binding affinity. Individual models from nine top-performing teams were analyzed for performance and uncertainty using regression metrics and applicability domains (AD). Consensus models were developed by averaging predictions across these models, with and without consideration of their ADs. While applying AD constraints in individual models generally improved external prediction accuracy (at the expense of reduced chemical space coverage), it had limited additional benefit for consensus models. Results showed that consensus models outperformed individual models, achieving a root-mean-square error (RMSE) of 19.8% on the test set, compared to an average RMSE of 20.9% for the nine individual models. Outliers consistently identified in several of these models indicate potential experimental artifacts and/or activity cliffs, requiring further investigation. Substructure importance analysis revealed that models prioritized different chemical features, and consensus averaging harmonized these divergent perspectives. These findings highlight the value of consensus modeling in improving predictive performance and addressing model limitations. Future work should focus on expanding chemical space coverage and refining experimental data sets to support public health protection.</p>","PeriodicalId":31,"journal":{"name":"Chemical Research in Toxicology","volume":" ","pages":""},"PeriodicalIF":3.7000,"publicationDate":"2025-05-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chemical Research in Toxicology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1021/acs.chemrestox.5c00018","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, MEDICINAL","Score":null,"Total":0}
引用次数: 0
Abstract
Transthyretin (TTR) is a key transporter of the thyroid hormone thyroxine, and chemicals that bind to TTR, displacing the hormone, can disrupt the endocrine system, even at low concentrations. This study evaluates computational modeling strategies developed during the Tox24 Challenge, using a data set of 1512 compounds tested for TTR binding affinity. Individual models from nine top-performing teams were analyzed for performance and uncertainty using regression metrics and applicability domains (AD). Consensus models were developed by averaging predictions across these models, with and without consideration of their ADs. While applying AD constraints in individual models generally improved external prediction accuracy (at the expense of reduced chemical space coverage), it had limited additional benefit for consensus models. Results showed that consensus models outperformed individual models, achieving a root-mean-square error (RMSE) of 19.8% on the test set, compared to an average RMSE of 20.9% for the nine individual models. Outliers consistently identified in several of these models indicate potential experimental artifacts and/or activity cliffs, requiring further investigation. Substructure importance analysis revealed that models prioritized different chemical features, and consensus averaging harmonized these divergent perspectives. These findings highlight the value of consensus modeling in improving predictive performance and addressing model limitations. Future work should focus on expanding chemical space coverage and refining experimental data sets to support public health protection.
期刊介绍:
Chemical Research in Toxicology publishes Articles, Rapid Reports, Chemical Profiles, Reviews, Perspectives, Letters to the Editor, and ToxWatch on a wide range of topics in Toxicology that inform a chemical and molecular understanding and capacity to predict biological outcomes on the basis of structures and processes. The overarching goal of activities reported in the Journal are to provide knowledge and innovative approaches needed to promote intelligent solutions for human safety and ecosystem preservation. The journal emphasizes insight concerning mechanisms of toxicity over phenomenological observations. It upholds rigorous chemical, physical and mathematical standards for characterization and application of modern techniques.