Integrating machine learning and nano-QSAR models to predict the oxidative stress potential caused by single and mixed carbon nanomaterials in algal cells.
{"title":"Integrating machine learning and nano-QSAR models to predict the oxidative stress potential caused by single and mixed carbon nanomaterials in algal cells.","authors":"Qi Qi, Zhuang Wang","doi":"10.1093/etojnl/vgae049","DOIUrl":null,"url":null,"abstract":"<p><p>In silico methods are increasingly important in predicting the ecotoxicity of engineered nanomaterials (ENMs), encompassing both individual and mixture toxicity predictions. It is widely recognized that ENMs trigger oxidative stress effects by generating intracellular reactive oxygen species (ROS), serving as a key mechanism in their cytotoxicity studies. However, existing in silico methods still face significant challenges in predicting the oxidative stress effects induced by ENMs. Herein, we utilized laboratory-derived toxicity data and machine learning methods to develop quantitative nanostructure-activity relationship (nano-QSAR) classification and regression models, aiming to predict the oxidative stress effects of five carbon nanomaterials (fullerene, graphene, graphene oxide, single-walled carbon nanotubes, and multi-walled carbon nanotubes) and their binary mixtures on Scenedesmus obliquus cells. We constructed five nano-QSAR classification models by combining zeta potential (ζP) with the C4.5 decision tree, support vector machine, artificial neural network, naive Bayes, and K-nearest neighbor algorithms. Moreover, we constructed three classification models by integrating the features including ζP, hydrodynamic diameter (DH), and specific surface area (SSA) with the logistic regression, random forest, and Adaboost algorithms. The Accuracy, Recall, Precision and harmonic mean of Precision and Recall (F1-score) values of these models were all higher than 0.600, indicating an excellent performance in distinguishing whether CNMs have the potential to generate ROS. In addition, using the ζP, DH, and SSA descriptors, we combined decision tree regression, random forest regression, gradient boosting, and the Adaboost algorithm, and successfully constructed four nano-QSAR regression models with applicable application domains (all training and testing data points lie within 95% confidence intervals), goodness-of-fit (Rtrain2 ≥ 0.850), and robustness (cross-validation R2 ≥ 0.650) as well as predictive power (Rtest2 ≥ 0.610). The method developed would establish a fundamental basis for more precise evaluations of ecological risks posed by these materials from a mechanistic standpoint.</p>","PeriodicalId":11793,"journal":{"name":"Environmental Toxicology and Chemistry","volume":" ","pages":""},"PeriodicalIF":3.6000,"publicationDate":"2025-01-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Toxicology and Chemistry","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1093/etojnl/vgae049","RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
In silico methods are increasingly important in predicting the ecotoxicity of engineered nanomaterials (ENMs), encompassing both individual and mixture toxicity predictions. It is widely recognized that ENMs trigger oxidative stress effects by generating intracellular reactive oxygen species (ROS), serving as a key mechanism in their cytotoxicity studies. However, existing in silico methods still face significant challenges in predicting the oxidative stress effects induced by ENMs. Herein, we utilized laboratory-derived toxicity data and machine learning methods to develop quantitative nanostructure-activity relationship (nano-QSAR) classification and regression models, aiming to predict the oxidative stress effects of five carbon nanomaterials (fullerene, graphene, graphene oxide, single-walled carbon nanotubes, and multi-walled carbon nanotubes) and their binary mixtures on Scenedesmus obliquus cells. We constructed five nano-QSAR classification models by combining zeta potential (ζP) with the C4.5 decision tree, support vector machine, artificial neural network, naive Bayes, and K-nearest neighbor algorithms. Moreover, we constructed three classification models by integrating the features including ζP, hydrodynamic diameter (DH), and specific surface area (SSA) with the logistic regression, random forest, and Adaboost algorithms. The Accuracy, Recall, Precision and harmonic mean of Precision and Recall (F1-score) values of these models were all higher than 0.600, indicating an excellent performance in distinguishing whether CNMs have the potential to generate ROS. In addition, using the ζP, DH, and SSA descriptors, we combined decision tree regression, random forest regression, gradient boosting, and the Adaboost algorithm, and successfully constructed four nano-QSAR regression models with applicable application domains (all training and testing data points lie within 95% confidence intervals), goodness-of-fit (Rtrain2 ≥ 0.850), and robustness (cross-validation R2 ≥ 0.650) as well as predictive power (Rtest2 ≥ 0.610). The method developed would establish a fundamental basis for more precise evaluations of ecological risks posed by these materials from a mechanistic standpoint.
期刊介绍:
The Society of Environmental Toxicology and Chemistry (SETAC) publishes two journals: Environmental Toxicology and Chemistry (ET&C) and Integrated Environmental Assessment and Management (IEAM). Environmental Toxicology and Chemistry is dedicated to furthering scientific knowledge and disseminating information on environmental toxicology and chemistry, including the application of these sciences to risk assessment.[...]
Environmental Toxicology and Chemistry is interdisciplinary in scope and integrates the fields of environmental toxicology; environmental, analytical, and molecular chemistry; ecology; physiology; biochemistry; microbiology; genetics; genomics; environmental engineering; chemical, environmental, and biological modeling; epidemiology; and earth sciences. ET&C seeks to publish papers describing original experimental or theoretical work that significantly advances understanding in the area of environmental toxicology, environmental chemistry and hazard/risk assessment. Emphasis is given to papers that enhance capabilities for the prediction, measurement, and assessment of the fate and effects of chemicals in the environment, rather than simply providing additional data. The scientific impact of papers is judged in terms of the breadth and depth of the findings and the expected influence on existing or future scientific practice. Methodological papers must make clear not only how the work differs from existing practice, but the significance of these differences to the field. Site-based research or monitoring must have regional or global implications beyond the particular site, such as evaluating processes, mechanisms, or theory under a natural environmental setting.