Using machine learning models to predict the dose–effect curve of municipal wastewater for zebrafish embryo toxicity

IF 12.2 1区 环境科学与生态学 Q1 ENGINEERING, ENVIRONMENTAL
Mengyuan Zhu, Yushi Fang, Min Jia, Ling Chen, Linyu Zhang, Bing Wu
{"title":"Using machine learning models to predict the dose–effect curve of municipal wastewater for zebrafish embryo toxicity","authors":"Mengyuan Zhu, Yushi Fang, Min Jia, Ling Chen, Linyu Zhang, Bing Wu","doi":"10.1016/j.jhazmat.2025.137278","DOIUrl":null,"url":null,"abstract":"Municipal wastewater substantially contributes to aquatic ecological risks. Assessing the toxicity of municipal wastewater through dose–effect curves is challenging owing to the time-consuming, labor-intensive, and costly nature of biological assays. This study developed machine learning models to predict wastewater dose–effect curves for zebrafish embryos. The influent and effluent samples from 176 wastewater treatment plants in China were analyzed to collect water quality data, including information on seven chemical parameters and the toxic effects on zebrafish embryos at eight relative enrichment factors (REFs) of wastewater. Using Spearman’s rank correlation coefficient and the max-relevance and min-redundancy algorithm, the parameters of ammonium nitrogen content and toxic effect values at REFs of 2 and 25 (REF2 and REF25), were identified as crucial input features from 15 variables. Decision tree, random forest, and gradient-boosted decision tree (GBDT) models were developed. Among these, GBDT exhibited the best performance, with an average R<sup>2</sup> value of 0.91 and an average mean absolute percentage error (MAPE) of 27.91%. Integrating the dose–effect curve pattern into the machine learning model considerably optimized the GBDT model, reaching a minimum MAPE of 14.74%. The developed model can accurately determine the dose–effect curves of actual wastewater, reducing at least 75% of the experimental workload. These findings provide a valuable tool for assessing zebrafish embryo toxicity in municipal wastewater management. This study indicates that combining environmental expertise and machine learning models allows for a scientific assessment of the potential toxic risks in wastewater, providing new perspectives and approaches for environmental policy development.","PeriodicalId":361,"journal":{"name":"Journal of Hazardous Materials","volume":"22 1","pages":""},"PeriodicalIF":12.2000,"publicationDate":"2025-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Hazardous Materials","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1016/j.jhazmat.2025.137278","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 0

Abstract

Municipal wastewater substantially contributes to aquatic ecological risks. Assessing the toxicity of municipal wastewater through dose–effect curves is challenging owing to the time-consuming, labor-intensive, and costly nature of biological assays. This study developed machine learning models to predict wastewater dose–effect curves for zebrafish embryos. The influent and effluent samples from 176 wastewater treatment plants in China were analyzed to collect water quality data, including information on seven chemical parameters and the toxic effects on zebrafish embryos at eight relative enrichment factors (REFs) of wastewater. Using Spearman’s rank correlation coefficient and the max-relevance and min-redundancy algorithm, the parameters of ammonium nitrogen content and toxic effect values at REFs of 2 and 25 (REF2 and REF25), were identified as crucial input features from 15 variables. Decision tree, random forest, and gradient-boosted decision tree (GBDT) models were developed. Among these, GBDT exhibited the best performance, with an average R2 value of 0.91 and an average mean absolute percentage error (MAPE) of 27.91%. Integrating the dose–effect curve pattern into the machine learning model considerably optimized the GBDT model, reaching a minimum MAPE of 14.74%. The developed model can accurately determine the dose–effect curves of actual wastewater, reducing at least 75% of the experimental workload. These findings provide a valuable tool for assessing zebrafish embryo toxicity in municipal wastewater management. This study indicates that combining environmental expertise and machine learning models allows for a scientific assessment of the potential toxic risks in wastewater, providing new perspectives and approaches for environmental policy development.

Abstract Image

利用机器学习模型预测城市污水对斑马鱼胚胎毒性的剂量效应曲线
城市污水极大地加剧了水生生态风险。由于生物检测耗时、耗力且成本高昂,通过剂量效应曲线评估城市污水的毒性具有挑战性。本研究开发了机器学习模型来预测废水对斑马鱼胚胎的剂量效应曲线。该研究分析了中国 176 个污水处理厂的进水和出水样本,收集了水质数据,包括 7 个化学参数信息以及污水在 8 个相对富集因子(REF)下对斑马鱼胚胎的毒性影响。利用斯皮尔曼秩相关系数以及最大相关性和最小冗余度算法,从 15 个变量中识别出铵态氮含量参数和相对富集因子为 2 和 25(REF2 和 REF25)时的毒性效应值作为关键输入特征。开发了决策树、随机森林和梯度增强决策树(GBDT)模型。其中,GBDT 表现最佳,平均 R2 值为 0.91,平均绝对百分比误差 (MAPE) 为 27.91%。将剂量效应曲线模式整合到机器学习模型中大大优化了 GBDT 模型,使 MAPE 最小达到 14.74%。所开发的模型可以准确确定实际废水的剂量效应曲线,至少减少了 75% 的实验工作量。这些发现为评估城市污水管理中斑马鱼胚胎的毒性提供了宝贵的工具。这项研究表明,将环境专业知识与机器学习模型相结合,可以对废水中的潜在毒性风险进行科学评估,为环境政策的制定提供新的视角和方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Hazardous Materials
Journal of Hazardous Materials 工程技术-工程:环境
CiteScore
25.40
自引率
5.90%
发文量
3059
审稿时长
58 days
期刊介绍: The Journal of Hazardous Materials serves as a global platform for promoting cutting-edge research in the field of Environmental Science and Engineering. Our publication features a wide range of articles, including full-length research papers, review articles, and perspectives, with the aim of enhancing our understanding of the dangers and risks associated with various materials concerning public health and the environment. It is important to note that the term "environmental contaminants" refers specifically to substances that pose hazardous effects through contamination, while excluding those that do not have such impacts on the environment or human health. Moreover, we emphasize the distinction between wastes and hazardous materials in order to provide further clarity on the scope of the journal. We have a keen interest in exploring specific compounds and microbial agents that have adverse effects on the environment.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信