{"title":"Using machine learning models to predict the dose–effect curve of municipal wastewater for zebrafish embryo toxicity","authors":"Mengyuan Zhu, Yushi Fang, Min Jia, Ling Chen, Linyu Zhang, Bing Wu","doi":"10.1016/j.jhazmat.2025.137278","DOIUrl":null,"url":null,"abstract":"Municipal wastewater substantially contributes to aquatic ecological risks. Assessing the toxicity of municipal wastewater through dose–effect curves is challenging owing to the time-consuming, labor-intensive, and costly nature of biological assays. This study developed machine learning models to predict wastewater dose–effect curves for zebrafish embryos. The influent and effluent samples from 176 wastewater treatment plants in China were analyzed to collect water quality data, including information on seven chemical parameters and the toxic effects on zebrafish embryos at eight relative enrichment factors (REFs) of wastewater. Using Spearman’s rank correlation coefficient and the max-relevance and min-redundancy algorithm, the parameters of ammonium nitrogen content and toxic effect values at REFs of 2 and 25 (REF2 and REF25), were identified as crucial input features from 15 variables. Decision tree, random forest, and gradient-boosted decision tree (GBDT) models were developed. Among these, GBDT exhibited the best performance, with an average R<sup>2</sup> value of 0.91 and an average mean absolute percentage error (MAPE) of 27.91%. Integrating the dose–effect curve pattern into the machine learning model considerably optimized the GBDT model, reaching a minimum MAPE of 14.74%. The developed model can accurately determine the dose–effect curves of actual wastewater, reducing at least 75% of the experimental workload. These findings provide a valuable tool for assessing zebrafish embryo toxicity in municipal wastewater management. This study indicates that combining environmental expertise and machine learning models allows for a scientific assessment of the potential toxic risks in wastewater, providing new perspectives and approaches for environmental policy development.","PeriodicalId":361,"journal":{"name":"Journal of Hazardous Materials","volume":"22 1","pages":""},"PeriodicalIF":12.2000,"publicationDate":"2025-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Hazardous Materials","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1016/j.jhazmat.2025.137278","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
Municipal wastewater substantially contributes to aquatic ecological risks. Assessing the toxicity of municipal wastewater through dose–effect curves is challenging owing to the time-consuming, labor-intensive, and costly nature of biological assays. This study developed machine learning models to predict wastewater dose–effect curves for zebrafish embryos. The influent and effluent samples from 176 wastewater treatment plants in China were analyzed to collect water quality data, including information on seven chemical parameters and the toxic effects on zebrafish embryos at eight relative enrichment factors (REFs) of wastewater. Using Spearman’s rank correlation coefficient and the max-relevance and min-redundancy algorithm, the parameters of ammonium nitrogen content and toxic effect values at REFs of 2 and 25 (REF2 and REF25), were identified as crucial input features from 15 variables. Decision tree, random forest, and gradient-boosted decision tree (GBDT) models were developed. Among these, GBDT exhibited the best performance, with an average R2 value of 0.91 and an average mean absolute percentage error (MAPE) of 27.91%. Integrating the dose–effect curve pattern into the machine learning model considerably optimized the GBDT model, reaching a minimum MAPE of 14.74%. The developed model can accurately determine the dose–effect curves of actual wastewater, reducing at least 75% of the experimental workload. These findings provide a valuable tool for assessing zebrafish embryo toxicity in municipal wastewater management. This study indicates that combining environmental expertise and machine learning models allows for a scientific assessment of the potential toxic risks in wastewater, providing new perspectives and approaches for environmental policy development.
期刊介绍:
The Journal of Hazardous Materials serves as a global platform for promoting cutting-edge research in the field of Environmental Science and Engineering. Our publication features a wide range of articles, including full-length research papers, review articles, and perspectives, with the aim of enhancing our understanding of the dangers and risks associated with various materials concerning public health and the environment. It is important to note that the term "environmental contaminants" refers specifically to substances that pose hazardous effects through contamination, while excluding those that do not have such impacts on the environment or human health. Moreover, we emphasize the distinction between wastes and hazardous materials in order to provide further clarity on the scope of the journal. We have a keen interest in exploring specific compounds and microbial agents that have adverse effects on the environment.