Grace Patlewicz , Nathaniel Charest , Amanda Ross , HC Bledsoe , Janielle Vidal , Sadegh Faramarzi , Brett Hagan , Imran Shah
{"title":"构建一个专家驱动的跨读案例汇编,以方便分析不同相似上下文对跨读性能的贡献","authors":"Grace Patlewicz , Nathaniel Charest , Amanda Ross , HC Bledsoe , Janielle Vidal , Sadegh Faramarzi , Brett Hagan , Imran Shah","doi":"10.1016/j.comtox.2025.100366","DOIUrl":null,"url":null,"abstract":"<div><div>Read-across is a data-gap filling technique used to predict the toxicity of a target chemical based on data from similar analogues. It is predominantly performed through expert-driven assessments which can limit reproducibility and broader acceptance. Data-driven approaches such as Generalised Read-Across (GenRA) offer the potential to generate more reproducible read-across predictions with quantified uncertainties and performance metrics. A key challenge is reconciling expert- and data-driven approaches particularly in how analogues are identified, evaluated and used to derive predictions. A critical aspect of analogue selection lies in understanding the relative contribution of different similarity contexts e.g. whether structural similarity plays a larger role than metabolism similarity. This study explored these considerations by compiling a compendium of expert-driven read-across assessments for repeated dose toxicity endpoints from peer reviewed and grey literature. Pairwise similarity was quantified across structural, physicochemical, metabolic and reactivity features within each case and a prediction model was developed to evaluate the contribution of each similarity context in analogue selection. Although the dataset comprised 157 read-across cases and 695 unique substances, it was limited in size, heterogeneous in origin and variable in analogue selection criteria and use contexts. These factors constrain generalisability of the findings and indicate that conclusions should be interpreted with caution. Nonetheless, the qualitative insight that structure and metabolism were influential led to a followup investigation using graph-based deep learning to explore whether embeddings derived from structure and/or metabolism information could improve read-across predictions, using repeated dose toxicity as a case study, relative to structural similarity baselines.</div></div>","PeriodicalId":37651,"journal":{"name":"Computational Toxicology","volume":"35 ","pages":"Article 100366"},"PeriodicalIF":3.1000,"publicationDate":"2025-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Building a compendium of expert driven read-across cases to facilitate an analysis of the contribution that different similarity contexts play in read-across performance\",\"authors\":\"Grace Patlewicz , Nathaniel Charest , Amanda Ross , HC Bledsoe , Janielle Vidal , Sadegh Faramarzi , Brett Hagan , Imran Shah\",\"doi\":\"10.1016/j.comtox.2025.100366\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Read-across is a data-gap filling technique used to predict the toxicity of a target chemical based on data from similar analogues. It is predominantly performed through expert-driven assessments which can limit reproducibility and broader acceptance. Data-driven approaches such as Generalised Read-Across (GenRA) offer the potential to generate more reproducible read-across predictions with quantified uncertainties and performance metrics. A key challenge is reconciling expert- and data-driven approaches particularly in how analogues are identified, evaluated and used to derive predictions. A critical aspect of analogue selection lies in understanding the relative contribution of different similarity contexts e.g. whether structural similarity plays a larger role than metabolism similarity. This study explored these considerations by compiling a compendium of expert-driven read-across assessments for repeated dose toxicity endpoints from peer reviewed and grey literature. Pairwise similarity was quantified across structural, physicochemical, metabolic and reactivity features within each case and a prediction model was developed to evaluate the contribution of each similarity context in analogue selection. Although the dataset comprised 157 read-across cases and 695 unique substances, it was limited in size, heterogeneous in origin and variable in analogue selection criteria and use contexts. These factors constrain generalisability of the findings and indicate that conclusions should be interpreted with caution. Nonetheless, the qualitative insight that structure and metabolism were influential led to a followup investigation using graph-based deep learning to explore whether embeddings derived from structure and/or metabolism information could improve read-across predictions, using repeated dose toxicity as a case study, relative to structural similarity baselines.</div></div>\",\"PeriodicalId\":37651,\"journal\":{\"name\":\"Computational Toxicology\",\"volume\":\"35 \",\"pages\":\"Article 100366\"},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2025-07-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computational Toxicology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S246811132500026X\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"TOXICOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Toxicology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S246811132500026X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"TOXICOLOGY","Score":null,"Total":0}
Building a compendium of expert driven read-across cases to facilitate an analysis of the contribution that different similarity contexts play in read-across performance
Read-across is a data-gap filling technique used to predict the toxicity of a target chemical based on data from similar analogues. It is predominantly performed through expert-driven assessments which can limit reproducibility and broader acceptance. Data-driven approaches such as Generalised Read-Across (GenRA) offer the potential to generate more reproducible read-across predictions with quantified uncertainties and performance metrics. A key challenge is reconciling expert- and data-driven approaches particularly in how analogues are identified, evaluated and used to derive predictions. A critical aspect of analogue selection lies in understanding the relative contribution of different similarity contexts e.g. whether structural similarity plays a larger role than metabolism similarity. This study explored these considerations by compiling a compendium of expert-driven read-across assessments for repeated dose toxicity endpoints from peer reviewed and grey literature. Pairwise similarity was quantified across structural, physicochemical, metabolic and reactivity features within each case and a prediction model was developed to evaluate the contribution of each similarity context in analogue selection. Although the dataset comprised 157 read-across cases and 695 unique substances, it was limited in size, heterogeneous in origin and variable in analogue selection criteria and use contexts. These factors constrain generalisability of the findings and indicate that conclusions should be interpreted with caution. Nonetheless, the qualitative insight that structure and metabolism were influential led to a followup investigation using graph-based deep learning to explore whether embeddings derived from structure and/or metabolism information could improve read-across predictions, using repeated dose toxicity as a case study, relative to structural similarity baselines.
期刊介绍:
Computational Toxicology is an international journal publishing computational approaches that assist in the toxicological evaluation of new and existing chemical substances assisting in their safety assessment. -All effects relating to human health and environmental toxicity and fate -Prediction of toxicity, metabolism, fate and physico-chemical properties -The development of models from read-across, (Q)SARs, PBPK, QIVIVE, Multi-Scale Models -Big Data in toxicology: integration, management, analysis -Implementation of models through AOPs, IATA, TTC -Regulatory acceptance of models: evaluation, verification and validation -From metals, to small organic molecules to nanoparticles -Pharmaceuticals, pesticides, foods, cosmetics, fine chemicals -Bringing together the views of industry, regulators, academia, NGOs