Ani Tevosyan , Hrach Yeghiazaryan , Gohar Tadevosyan , Lilit Apresyan , Vahe Atoyan , Anna Misakyan , Zaven Navoyan , Helga Stopper , Nelly Babayan , Lusine Khondkaryan
{"title":"AI/ML modeling to enhance the capability of in vitro and in vivo tests in predicting human carcinogenicity","authors":"Ani Tevosyan , Hrach Yeghiazaryan , Gohar Tadevosyan , Lilit Apresyan , Vahe Atoyan , Anna Misakyan , Zaven Navoyan , Helga Stopper , Nelly Babayan , Lusine Khondkaryan","doi":"10.1016/j.mrgentox.2025.503858","DOIUrl":null,"url":null,"abstract":"<div><div>This study aimed to develop an in silico model for predicting human carcinogenicity using advanced deep learning techniques, specifically Graph Neural Networks (GNN), through a multitask learning (MTL) approach. The MTL framework leveraged auxiliary tasks, including mutagenicity, genotoxicity, animal carcinogenicity, androgen and estrogen receptor binding, to enhance the model's predictive capabilities for the primary task of human carcinogenicity. Three distinct GNN architectures were used alongside various combinations of auxiliary tasks to evaluate the variations in performance metrics. Results demonstrated that multitask learning significantly enhances the predictive performance of GNN models compared to single-task learning for predicting human carcinogenicity. The best performed MTL model achieved an area under the curve of 0.89, along with a balanced accuracy of 82 %, and sensitivity and specificity values of 0.75 and 0.89, respectively. The developed multitask learning (MTL) models function on tasks that represent assays for identifying both genotoxic and non-genotoxic carcinogens, thereby enhancing the model's capability to predict human carcinogenic risk with greater accuracy. The advanced GNN models demonstrated effectiveness in addressing data imbalance issues frequently observed in biological datasets, mitigating the bias that typically favors one class over another. Overall, these results underscore the promise of GNN-based MTL models for reliable chemical screening and prioritization, particularly in predicting human carcinogenicity.</div></div>","PeriodicalId":18799,"journal":{"name":"Mutation research. Genetic toxicology and environmental mutagenesis","volume":"903 ","pages":"Article 503858"},"PeriodicalIF":2.3000,"publicationDate":"2025-02-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mutation research. Genetic toxicology and environmental mutagenesis","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1383571825000178","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOTECHNOLOGY & APPLIED MICROBIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
This study aimed to develop an in silico model for predicting human carcinogenicity using advanced deep learning techniques, specifically Graph Neural Networks (GNN), through a multitask learning (MTL) approach. The MTL framework leveraged auxiliary tasks, including mutagenicity, genotoxicity, animal carcinogenicity, androgen and estrogen receptor binding, to enhance the model's predictive capabilities for the primary task of human carcinogenicity. Three distinct GNN architectures were used alongside various combinations of auxiliary tasks to evaluate the variations in performance metrics. Results demonstrated that multitask learning significantly enhances the predictive performance of GNN models compared to single-task learning for predicting human carcinogenicity. The best performed MTL model achieved an area under the curve of 0.89, along with a balanced accuracy of 82 %, and sensitivity and specificity values of 0.75 and 0.89, respectively. The developed multitask learning (MTL) models function on tasks that represent assays for identifying both genotoxic and non-genotoxic carcinogens, thereby enhancing the model's capability to predict human carcinogenic risk with greater accuracy. The advanced GNN models demonstrated effectiveness in addressing data imbalance issues frequently observed in biological datasets, mitigating the bias that typically favors one class over another. Overall, these results underscore the promise of GNN-based MTL models for reliable chemical screening and prioritization, particularly in predicting human carcinogenicity.
期刊介绍:
Mutation Research - Genetic Toxicology and Environmental Mutagenesis (MRGTEM) publishes papers advancing knowledge in the field of genetic toxicology. Papers are welcomed in the following areas:
New developments in genotoxicity testing of chemical agents (e.g. improvements in methodology of assay systems and interpretation of results).
Alternatives to and refinement of the use of animals in genotoxicity testing.
Nano-genotoxicology, the study of genotoxicity hazards and risks related to novel man-made nanomaterials.
Studies of epigenetic changes in relation to genotoxic effects.
The use of structure-activity relationships in predicting genotoxic effects.
The isolation and chemical characterization of novel environmental mutagens.
The measurement of genotoxic effects in human populations, when accompanied by quantitative measurements of environmental or occupational exposures.
The application of novel technologies for assessing the hazard and risks associated with genotoxic substances (e.g. OMICS or other high-throughput approaches to genotoxicity testing).
MRGTEM is now accepting submissions for a new section of the journal: Current Topics in Genotoxicity Testing, that will be dedicated to the discussion of current issues relating to design, interpretation and strategic use of genotoxicity tests. This section is envisaged to include discussions relating to the development of new international testing guidelines, but also to wider topics in the field. The evaluation of contrasting or opposing viewpoints is welcomed as long as the presentation is in accordance with the journal''s aims, scope, and policies.