迈向跨读中的代谢相似性：使用图卷积网络预测模拟代谢网络的遗传毒性结果的案例研究

IF 3.8 3区医学 Q2 CHEMISTRY, MEDICINAL

Chemical Research in Toxicology Pub Date : 2025-05-27 DOI:10.1021/acs.chemrestox.5c0012010.1021/acs.chemrestox.5c00120

Brett Hagan, Louis Groff, Grace Patlewicz and Imran Shah*,

{"title":"迈向跨读中的代谢相似性：使用图卷积网络预测模拟代谢网络的遗传毒性结果的案例研究","authors":"Brett Hagan, Louis Groff, Grace Patlewicz and Imran Shah*, ","doi":"10.1021/acs.chemrestox.5c0012010.1021/acs.chemrestox.5c00120","DOIUrl":null,"url":null,"abstract":"<p >Metabolic similarity is a key consideration in evaluating candidate source analogues for read-across (RAx), but approaches to systematically characterize metabolism for read-across prediction are still evolving. Metabolic similarity is multifaceted, considering the similarity of the metabolic tree, the metabolites simulated, and the transformation pathways. The structure of metabolic trees lends itself naturally to graph representations, for which several methods, including graph convolutional networks (GCNs), can be applied to quantify the pairwise similarity between the target and source analogue(s) within an analogue or category approach. In this study, we compared metabolic graph representations of metabolites with structural similarities in predicting genotoxicity outcomes using a data set comprising 5403 chemicals. Xenobiotic metabolism pathways were predicted using the rat liver models within the commercial expert system, TIssue MEtabolism Simulator (TIMES), and the phase I and II xenobiotic metabolism modules within the freely available system BioTransformer. Metabolic pathways were converted to graphs and used to train GCNs, generating embeddings for each chemical. The classification performance of generalized read-across (GenRA), random forest (RF), logistic regression (LR), and multilayer perceptron (MLP) was compared using GCN-derived embeddings versus both Morgan and MACCS chemical fingerprints to identify genotoxic chemicals. GCN embeddings with LR, based on in vivo TIMES metabolism predictions using MACCS fingerprints as node features, achieved the highest area under the curve of the receiver operating characteristic of 0.807, outperforming GenRA and LR with MACCS fingerprints by 14.47% and 5.49%, respectively. Our findings suggest that GCN embeddings of predicted metabolism pathways perform substantially better than structural features of the parent chemicals in predicting genotoxicity outcomes. Such GCN embeddings offer new avenues of systematically encoding end point metabolic information to facilitate analogue identification for read-across.</p>","PeriodicalId":31,"journal":{"name":"Chemical Research in Toxicology","volume":"38 6","pages":"1122–1133 1122–1133"},"PeriodicalIF":3.8000,"publicationDate":"2025-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Toward Metabolic Similarity in Read-Across: A Case Study Using Graph Convolutional Networks to Predict Genotoxicity Outcomes from Simulated Metabolic Networks\",\"authors\":\"Brett Hagan, Louis Groff, Grace Patlewicz and Imran Shah*, \",\"doi\":\"10.1021/acs.chemrestox.5c0012010.1021/acs.chemrestox.5c00120\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p >Metabolic similarity is a key consideration in evaluating candidate source analogues for read-across (RAx), but approaches to systematically characterize metabolism for read-across prediction are still evolving. Metabolic similarity is multifaceted, considering the similarity of the metabolic tree, the metabolites simulated, and the transformation pathways. The structure of metabolic trees lends itself naturally to graph representations, for which several methods, including graph convolutional networks (GCNs), can be applied to quantify the pairwise similarity between the target and source analogue(s) within an analogue or category approach. In this study, we compared metabolic graph representations of metabolites with structural similarities in predicting genotoxicity outcomes using a data set comprising 5403 chemicals. Xenobiotic metabolism pathways were predicted using the rat liver models within the commercial expert system, TIssue MEtabolism Simulator (TIMES), and the phase I and II xenobiotic metabolism modules within the freely available system BioTransformer. Metabolic pathways were converted to graphs and used to train GCNs, generating embeddings for each chemical. The classification performance of generalized read-across (GenRA), random forest (RF), logistic regression (LR), and multilayer perceptron (MLP) was compared using GCN-derived embeddings versus both Morgan and MACCS chemical fingerprints to identify genotoxic chemicals. GCN embeddings with LR, based on in vivo TIMES metabolism predictions using MACCS fingerprints as node features, achieved the highest area under the curve of the receiver operating characteristic of 0.807, outperforming GenRA and LR with MACCS fingerprints by 14.47% and 5.49%, respectively. Our findings suggest that GCN embeddings of predicted metabolism pathways perform substantially better than structural features of the parent chemicals in predicting genotoxicity outcomes. Such GCN embeddings offer new avenues of systematically encoding end point metabolic information to facilitate analogue identification for read-across.</p>\",\"PeriodicalId\":31,\"journal\":{\"name\":\"Chemical Research in Toxicology\",\"volume\":\"38 6\",\"pages\":\"1122–1133 1122–1133\"},\"PeriodicalIF\":3.8000,\"publicationDate\":\"2025-05-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Chemical Research in Toxicology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://pubs.acs.org/doi/10.1021/acs.chemrestox.5c00120\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CHEMISTRY, MEDICINAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chemical Research in Toxicology","FirstCategoryId":"3","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acs.chemrestox.5c00120","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, MEDICINAL","Score":null,"Total":0}

引用次数: 0

摘要

代谢相似性是评估跨读（RAx）候选源类似物的关键考虑因素，但系统表征跨读预测代谢的方法仍在发展中。代谢相似性是多方面的，考虑到代谢树、模拟代谢物和转化途径的相似性。代谢树的结构自然地适合于图形表示，为此，包括图卷积网络（GCNs）在内的几种方法可以应用于在模拟或类别方法中量化目标和源模拟物之间的成对相似性。在这项研究中，我们使用包含5403种化学物质的数据集，比较了代谢物的代谢图表示和结构相似性，以预测遗传毒性结果。利用商业专家系统、组织代谢模拟器（TIMES）中的大鼠肝脏模型和免费系统BioTransformer中的I期和II期异种代谢模块预测异种代谢途径。代谢途径被转换成图形，用于训练GCNs，为每种化学物质生成嵌入。使用gcn衍生的嵌入与Morgan和MACCS化学指纹鉴别遗传毒性化学物质，比较了广义跨读（GenRA）、随机森林（RF）、逻辑回归（LR）和多层感知器（MLP）的分类性能。基于体内TIMES代谢预测，以MACCS指纹作为节点特征，结合LR的GCN嵌入获得的接收者工作特征曲线下面积最高，为0.807，分别优于采用MACCS指纹的GenRA和LR，分别高出14.47%和5.49%。我们的研究结果表明，预测代谢途径的GCN嵌入比母体化学物质的结构特征在预测遗传毒性结果方面表现得更好。这样的GCN嵌入提供了系统编码端点代谢信息的新途径，以促进跨读的模拟识别。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Toward Metabolic Similarity in Read-Across: A Case Study Using Graph Convolutional Networks to Predict Genotoxicity Outcomes from Simulated Metabolic Networks

查看原文本刊更多论文

Toward Metabolic Similarity in Read-Across: A Case Study Using Graph Convolutional Networks to Predict Genotoxicity Outcomes from Simulated Metabolic Networks

Metabolic similarity is a key consideration in evaluating candidate source analogues for read-across (RAx), but approaches to systematically characterize metabolism for read-across prediction are still evolving. Metabolic similarity is multifaceted, considering the similarity of the metabolic tree, the metabolites simulated, and the transformation pathways. The structure of metabolic trees lends itself naturally to graph representations, for which several methods, including graph convolutional networks (GCNs), can be applied to quantify the pairwise similarity between the target and source analogue(s) within an analogue or category approach. In this study, we compared metabolic graph representations of metabolites with structural similarities in predicting genotoxicity outcomes using a data set comprising 5403 chemicals. Xenobiotic metabolism pathways were predicted using the rat liver models within the commercial expert system, TIssue MEtabolism Simulator (TIMES), and the phase I and II xenobiotic metabolism modules within the freely available system BioTransformer. Metabolic pathways were converted to graphs and used to train GCNs, generating embeddings for each chemical. The classification performance of generalized read-across (GenRA), random forest (RF), logistic regression (LR), and multilayer perceptron (MLP) was compared using GCN-derived embeddings versus both Morgan and MACCS chemical fingerprints to identify genotoxic chemicals. GCN embeddings with LR, based on in vivo TIMES metabolism predictions using MACCS fingerprints as node features, achieved the highest area under the curve of the receiver operating characteristic of 0.807, outperforming GenRA and LR with MACCS fingerprints by 14.47% and 5.49%, respectively. Our findings suggest that GCN embeddings of predicted metabolism pathways perform substantially better than structural features of the parent chemicals in predicting genotoxicity outcomes. Such GCN embeddings offer new avenues of systematically encoding end point metabolic information to facilitate analogue identification for read-across.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Chemical Research in Toxicology 医学-毒理学

CiteScore

7.90

自引率

7.30%

发文量

215

审稿时长

3.5 months

期刊介绍： Chemical Research in Toxicology publishes Articles, Rapid Reports, Chemical Profiles, Reviews, Perspectives, Letters to the Editor, and ToxWatch on a wide range of topics in Toxicology that inform a chemical and molecular understanding and capacity to predict biological outcomes on the basis of structures and processes. The overarching goal of activities reported in the Journal are to provide knowledge and innovative approaches needed to promote intelligent solutions for human safety and ecosystem preservation. The journal emphasizes insight concerning mechanisms of toxicity over phenomenological observations. It upholds rigorous chemical, physical and mathematical standards for characterization and application of modern techniques.