{"title":"探索混合指纹在广义解读中的使用及其对选定体内毒性结果的预测性能的影响","authors":"Aubrey Leary , Imran Shah , Grace Patlewicz","doi":"10.1016/j.comtox.2025.100349","DOIUrl":null,"url":null,"abstract":"<div><div>Read-across is a cost-efficient means of generating information for hazard assessment. Approaches such as Generalized Read-Across (GenRA) facilitate objective and reproducible read-across for untested substances. GenRA is a web application, and its prediction engine is also available as a python package (genra-py). Recent updates permit source analogues to be identified using ‘hybrid’ fingerprints, i.e. analogues identified based on more than one type of similarity measure. Herein, the performance of hybrid fingerprints relative to Morgan chemical fingerprints was evaluated for a selection of acute and chronic <em>in vivo</em> toxicity outcomes. Grid search and cross-validation on a dataset of 5,830 chemicals with rodent acute oral toxicity (LD<sub>50</sub>) values were used to tune the hybrid weight hyperparameter for up to four chemical fingerprints (Morgan, Torsion, ToxPrint and Analog Identification Methodology (AIM)). The optimal hybrid fingerprint derived (52.12% Morgan, 23.40% ToxPrint, 12.44% AIM, 12.04% Torsion) outperformed Morgan fingerprints across all 10 folds of a cross-validation procedure (mean test set coefficient of determination (R<sup>2</sup>) 0.517 (Morgan) vs. 0.557 (hybrid)). The hybrid fingerprint was then used to make toxicity predictions for 2 other datasets, a set of 3,266 chemicals with oral chronic human equivalent benchmark dose values (mean test set R<sup>2</sup> 0.445 vs. 0.417 for Morgan) and a set of 9,443 chemicals with acute mammalian oral hazard classifications (mean balanced accuracy (BA) 0.577 vs 0.553 for Morgan). Overall, performance improved when using the hybrid fingerprint tuned for the acute toxicity dataset. Using the custom hybrid option in GenRA results in improved read-across predictions relative to current defaults.</div></div>","PeriodicalId":37651,"journal":{"name":"Computational Toxicology","volume":"34 ","pages":"Article 100349"},"PeriodicalIF":3.1000,"publicationDate":"2025-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An exploration of the use of hybrid fingerprints in Generalized Read-Across and their impact on predictive performance for selected in vivo toxicity outcomes\",\"authors\":\"Aubrey Leary , Imran Shah , Grace Patlewicz\",\"doi\":\"10.1016/j.comtox.2025.100349\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Read-across is a cost-efficient means of generating information for hazard assessment. Approaches such as Generalized Read-Across (GenRA) facilitate objective and reproducible read-across for untested substances. GenRA is a web application, and its prediction engine is also available as a python package (genra-py). Recent updates permit source analogues to be identified using ‘hybrid’ fingerprints, i.e. analogues identified based on more than one type of similarity measure. Herein, the performance of hybrid fingerprints relative to Morgan chemical fingerprints was evaluated for a selection of acute and chronic <em>in vivo</em> toxicity outcomes. Grid search and cross-validation on a dataset of 5,830 chemicals with rodent acute oral toxicity (LD<sub>50</sub>) values were used to tune the hybrid weight hyperparameter for up to four chemical fingerprints (Morgan, Torsion, ToxPrint and Analog Identification Methodology (AIM)). The optimal hybrid fingerprint derived (52.12% Morgan, 23.40% ToxPrint, 12.44% AIM, 12.04% Torsion) outperformed Morgan fingerprints across all 10 folds of a cross-validation procedure (mean test set coefficient of determination (R<sup>2</sup>) 0.517 (Morgan) vs. 0.557 (hybrid)). The hybrid fingerprint was then used to make toxicity predictions for 2 other datasets, a set of 3,266 chemicals with oral chronic human equivalent benchmark dose values (mean test set R<sup>2</sup> 0.445 vs. 0.417 for Morgan) and a set of 9,443 chemicals with acute mammalian oral hazard classifications (mean balanced accuracy (BA) 0.577 vs 0.553 for Morgan). Overall, performance improved when using the hybrid fingerprint tuned for the acute toxicity dataset. Using the custom hybrid option in GenRA results in improved read-across predictions relative to current defaults.</div></div>\",\"PeriodicalId\":37651,\"journal\":{\"name\":\"Computational Toxicology\",\"volume\":\"34 \",\"pages\":\"Article 100349\"},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2025-04-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computational Toxicology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S246811132500009X\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"TOXICOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Toxicology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S246811132500009X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"TOXICOLOGY","Score":null,"Total":0}
An exploration of the use of hybrid fingerprints in Generalized Read-Across and their impact on predictive performance for selected in vivo toxicity outcomes
Read-across is a cost-efficient means of generating information for hazard assessment. Approaches such as Generalized Read-Across (GenRA) facilitate objective and reproducible read-across for untested substances. GenRA is a web application, and its prediction engine is also available as a python package (genra-py). Recent updates permit source analogues to be identified using ‘hybrid’ fingerprints, i.e. analogues identified based on more than one type of similarity measure. Herein, the performance of hybrid fingerprints relative to Morgan chemical fingerprints was evaluated for a selection of acute and chronic in vivo toxicity outcomes. Grid search and cross-validation on a dataset of 5,830 chemicals with rodent acute oral toxicity (LD50) values were used to tune the hybrid weight hyperparameter for up to four chemical fingerprints (Morgan, Torsion, ToxPrint and Analog Identification Methodology (AIM)). The optimal hybrid fingerprint derived (52.12% Morgan, 23.40% ToxPrint, 12.44% AIM, 12.04% Torsion) outperformed Morgan fingerprints across all 10 folds of a cross-validation procedure (mean test set coefficient of determination (R2) 0.517 (Morgan) vs. 0.557 (hybrid)). The hybrid fingerprint was then used to make toxicity predictions for 2 other datasets, a set of 3,266 chemicals with oral chronic human equivalent benchmark dose values (mean test set R2 0.445 vs. 0.417 for Morgan) and a set of 9,443 chemicals with acute mammalian oral hazard classifications (mean balanced accuracy (BA) 0.577 vs 0.553 for Morgan). Overall, performance improved when using the hybrid fingerprint tuned for the acute toxicity dataset. Using the custom hybrid option in GenRA results in improved read-across predictions relative to current defaults.
期刊介绍:
Computational Toxicology is an international journal publishing computational approaches that assist in the toxicological evaluation of new and existing chemical substances assisting in their safety assessment. -All effects relating to human health and environmental toxicity and fate -Prediction of toxicity, metabolism, fate and physico-chemical properties -The development of models from read-across, (Q)SARs, PBPK, QIVIVE, Multi-Scale Models -Big Data in toxicology: integration, management, analysis -Implementation of models through AOPs, IATA, TTC -Regulatory acceptance of models: evaluation, verification and validation -From metals, to small organic molecules to nanoparticles -Pharmaceuticals, pesticides, foods, cosmetics, fine chemicals -Bringing together the views of industry, regulators, academia, NGOs