Brett Hagan, Louis Groff, Grace Patlewicz and Imran Shah*,
{"title":"迈向跨读中的代谢相似性:使用图卷积网络预测模拟代谢网络的遗传毒性结果的案例研究","authors":"Brett Hagan, Louis Groff, Grace Patlewicz and Imran Shah*, ","doi":"10.1021/acs.chemrestox.5c0012010.1021/acs.chemrestox.5c00120","DOIUrl":null,"url":null,"abstract":"<p >Metabolic similarity is a key consideration in evaluating candidate source analogues for read-across (RAx), but approaches to systematically characterize metabolism for read-across prediction are still evolving. Metabolic similarity is multifaceted, considering the similarity of the metabolic tree, the metabolites simulated, and the transformation pathways. The structure of metabolic trees lends itself naturally to graph representations, for which several methods, including graph convolutional networks (GCNs), can be applied to quantify the pairwise similarity between the target and source analogue(s) within an analogue or category approach. In this study, we compared metabolic graph representations of metabolites with structural similarities in predicting genotoxicity outcomes using a data set comprising 5403 chemicals. Xenobiotic metabolism pathways were predicted using the rat liver models within the commercial expert system, TIssue MEtabolism Simulator (TIMES), and the phase I and II xenobiotic metabolism modules within the freely available system BioTransformer. Metabolic pathways were converted to graphs and used to train GCNs, generating embeddings for each chemical. The classification performance of generalized read-across (GenRA), random forest (RF), logistic regression (LR), and multilayer perceptron (MLP) was compared using GCN-derived embeddings versus both Morgan and MACCS chemical fingerprints to identify genotoxic chemicals. GCN embeddings with LR, based on in vivo TIMES metabolism predictions using MACCS fingerprints as node features, achieved the highest area under the curve of the receiver operating characteristic of 0.807, outperforming GenRA and LR with MACCS fingerprints by 14.47% and 5.49%, respectively. Our findings suggest that GCN embeddings of predicted metabolism pathways perform substantially better than structural features of the parent chemicals in predicting genotoxicity outcomes. Such GCN embeddings offer new avenues of systematically encoding end point metabolic information to facilitate analogue identification for read-across.</p>","PeriodicalId":31,"journal":{"name":"Chemical Research in Toxicology","volume":"38 6","pages":"1122–1133 1122–1133"},"PeriodicalIF":3.8000,"publicationDate":"2025-05-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Toward Metabolic Similarity in Read-Across: A Case Study Using Graph Convolutional Networks to Predict Genotoxicity Outcomes from Simulated Metabolic Networks\",\"authors\":\"Brett Hagan, Louis Groff, Grace Patlewicz and Imran Shah*, \",\"doi\":\"10.1021/acs.chemrestox.5c0012010.1021/acs.chemrestox.5c00120\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p >Metabolic similarity is a key consideration in evaluating candidate source analogues for read-across (RAx), but approaches to systematically characterize metabolism for read-across prediction are still evolving. Metabolic similarity is multifaceted, considering the similarity of the metabolic tree, the metabolites simulated, and the transformation pathways. The structure of metabolic trees lends itself naturally to graph representations, for which several methods, including graph convolutional networks (GCNs), can be applied to quantify the pairwise similarity between the target and source analogue(s) within an analogue or category approach. In this study, we compared metabolic graph representations of metabolites with structural similarities in predicting genotoxicity outcomes using a data set comprising 5403 chemicals. Xenobiotic metabolism pathways were predicted using the rat liver models within the commercial expert system, TIssue MEtabolism Simulator (TIMES), and the phase I and II xenobiotic metabolism modules within the freely available system BioTransformer. Metabolic pathways were converted to graphs and used to train GCNs, generating embeddings for each chemical. The classification performance of generalized read-across (GenRA), random forest (RF), logistic regression (LR), and multilayer perceptron (MLP) was compared using GCN-derived embeddings versus both Morgan and MACCS chemical fingerprints to identify genotoxic chemicals. GCN embeddings with LR, based on in vivo TIMES metabolism predictions using MACCS fingerprints as node features, achieved the highest area under the curve of the receiver operating characteristic of 0.807, outperforming GenRA and LR with MACCS fingerprints by 14.47% and 5.49%, respectively. Our findings suggest that GCN embeddings of predicted metabolism pathways perform substantially better than structural features of the parent chemicals in predicting genotoxicity outcomes. Such GCN embeddings offer new avenues of systematically encoding end point metabolic information to facilitate analogue identification for read-across.</p>\",\"PeriodicalId\":31,\"journal\":{\"name\":\"Chemical Research in Toxicology\",\"volume\":\"38 6\",\"pages\":\"1122–1133 1122–1133\"},\"PeriodicalIF\":3.8000,\"publicationDate\":\"2025-05-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Chemical Research in Toxicology\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://pubs.acs.org/doi/10.1021/acs.chemrestox.5c00120\",\"RegionNum\":3,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CHEMISTRY, MEDICINAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Chemical Research in Toxicology","FirstCategoryId":"3","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acs.chemrestox.5c00120","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, MEDICINAL","Score":null,"Total":0}
Toward Metabolic Similarity in Read-Across: A Case Study Using Graph Convolutional Networks to Predict Genotoxicity Outcomes from Simulated Metabolic Networks
Metabolic similarity is a key consideration in evaluating candidate source analogues for read-across (RAx), but approaches to systematically characterize metabolism for read-across prediction are still evolving. Metabolic similarity is multifaceted, considering the similarity of the metabolic tree, the metabolites simulated, and the transformation pathways. The structure of metabolic trees lends itself naturally to graph representations, for which several methods, including graph convolutional networks (GCNs), can be applied to quantify the pairwise similarity between the target and source analogue(s) within an analogue or category approach. In this study, we compared metabolic graph representations of metabolites with structural similarities in predicting genotoxicity outcomes using a data set comprising 5403 chemicals. Xenobiotic metabolism pathways were predicted using the rat liver models within the commercial expert system, TIssue MEtabolism Simulator (TIMES), and the phase I and II xenobiotic metabolism modules within the freely available system BioTransformer. Metabolic pathways were converted to graphs and used to train GCNs, generating embeddings for each chemical. The classification performance of generalized read-across (GenRA), random forest (RF), logistic regression (LR), and multilayer perceptron (MLP) was compared using GCN-derived embeddings versus both Morgan and MACCS chemical fingerprints to identify genotoxic chemicals. GCN embeddings with LR, based on in vivo TIMES metabolism predictions using MACCS fingerprints as node features, achieved the highest area under the curve of the receiver operating characteristic of 0.807, outperforming GenRA and LR with MACCS fingerprints by 14.47% and 5.49%, respectively. Our findings suggest that GCN embeddings of predicted metabolism pathways perform substantially better than structural features of the parent chemicals in predicting genotoxicity outcomes. Such GCN embeddings offer new avenues of systematically encoding end point metabolic information to facilitate analogue identification for read-across.
期刊介绍:
Chemical Research in Toxicology publishes Articles, Rapid Reports, Chemical Profiles, Reviews, Perspectives, Letters to the Editor, and ToxWatch on a wide range of topics in Toxicology that inform a chemical and molecular understanding and capacity to predict biological outcomes on the basis of structures and processes. The overarching goal of activities reported in the Journal are to provide knowledge and innovative approaches needed to promote intelligent solutions for human safety and ecosystem preservation. The journal emphasizes insight concerning mechanisms of toxicity over phenomenological observations. It upholds rigorous chemical, physical and mathematical standards for characterization and application of modern techniques.