{"title":"知识嵌入和可解释机器学习优化了水处理的综合效益","authors":"Yu-Qi Wang, Wenchong Tian, Hao-Lin Yang, Yun-Peng Song, Jia-Ji Chen, Qiong-Ying Xu, Wan-Xin Yin, Le-Qi Ding, Xi-Qi Li, Han-Tao Wang, Ai-Jie Wang, Hong-Cheng Wang","doi":"10.1038/s41545-025-00510-1","DOIUrl":null,"url":null,"abstract":"<p>Perikinetic and orthokinetic flocculation are the first steps in drinking water treatment plant (DWTP) and affect all subsequent processes. Leveraging multi-stage water quality parameters, we developed a machine learning (ML) framework for coagulation control that incorporates knowledge embedding (KE) through hyper-parametric constraints on threshold water quality, energy consumption, and economic costs. Random forest (RF) has the best performance among the eight methods with a percentage error of 2.53% and a coefficient of determination of 0.9922. The results of the interpretability analysis show that the model can accurately identify the coagulation demand and balance the removal effect with the energy consumption and economic cost. Through real experimental validation and simulation extrapolation, the RF-KE model can reduce turbidity by 16.36% and dosing cost by 9.64%. This framework reduces economic costs while optimizing water quality through KE and interpretability analyses, providing evidence for the safe and reliable application of future models.</p>","PeriodicalId":19375,"journal":{"name":"npj Clean Water","volume":"10 1","pages":""},"PeriodicalIF":11.4000,"publicationDate":"2025-08-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Knowledge embedding and interpretable machine learning optimize comprehensive benefits for water treatment\",\"authors\":\"Yu-Qi Wang, Wenchong Tian, Hao-Lin Yang, Yun-Peng Song, Jia-Ji Chen, Qiong-Ying Xu, Wan-Xin Yin, Le-Qi Ding, Xi-Qi Li, Han-Tao Wang, Ai-Jie Wang, Hong-Cheng Wang\",\"doi\":\"10.1038/s41545-025-00510-1\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Perikinetic and orthokinetic flocculation are the first steps in drinking water treatment plant (DWTP) and affect all subsequent processes. Leveraging multi-stage water quality parameters, we developed a machine learning (ML) framework for coagulation control that incorporates knowledge embedding (KE) through hyper-parametric constraints on threshold water quality, energy consumption, and economic costs. Random forest (RF) has the best performance among the eight methods with a percentage error of 2.53% and a coefficient of determination of 0.9922. The results of the interpretability analysis show that the model can accurately identify the coagulation demand and balance the removal effect with the energy consumption and economic cost. Through real experimental validation and simulation extrapolation, the RF-KE model can reduce turbidity by 16.36% and dosing cost by 9.64%. This framework reduces economic costs while optimizing water quality through KE and interpretability analyses, providing evidence for the safe and reliable application of future models.</p>\",\"PeriodicalId\":19375,\"journal\":{\"name\":\"npj Clean Water\",\"volume\":\"10 1\",\"pages\":\"\"},\"PeriodicalIF\":11.4000,\"publicationDate\":\"2025-08-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"npj Clean Water\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1038/s41545-025-00510-1\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, CHEMICAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"npj Clean Water","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1038/s41545-025-00510-1","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CHEMICAL","Score":null,"Total":0}
Knowledge embedding and interpretable machine learning optimize comprehensive benefits for water treatment
Perikinetic and orthokinetic flocculation are the first steps in drinking water treatment plant (DWTP) and affect all subsequent processes. Leveraging multi-stage water quality parameters, we developed a machine learning (ML) framework for coagulation control that incorporates knowledge embedding (KE) through hyper-parametric constraints on threshold water quality, energy consumption, and economic costs. Random forest (RF) has the best performance among the eight methods with a percentage error of 2.53% and a coefficient of determination of 0.9922. The results of the interpretability analysis show that the model can accurately identify the coagulation demand and balance the removal effect with the energy consumption and economic cost. Through real experimental validation and simulation extrapolation, the RF-KE model can reduce turbidity by 16.36% and dosing cost by 9.64%. This framework reduces economic costs while optimizing water quality through KE and interpretability analyses, providing evidence for the safe and reliable application of future models.
npj Clean WaterEnvironmental Science-Water Science and Technology
CiteScore
15.30
自引率
2.60%
发文量
61
审稿时长
5 weeks
期刊介绍:
npj Clean Water publishes high-quality papers that report cutting-edge science, technology, applications, policies, and societal issues contributing to a more sustainable supply of clean water. The journal's publications may also support and accelerate the achievement of Sustainable Development Goal 6, which focuses on clean water and sanitation.