Junfan Zhang , Yuzhi Xuan , Jingjing Lei , Liping Bai , Guobang Zhou , Yuelong Mao , Peinian Gong , Menghuan Zhang , Dajian Pan
{"title":"基于在线传感器和机器学习的地下水重金属预测系统——以典型工业园区为例","authors":"Junfan Zhang , Yuzhi Xuan , Jingjing Lei , Liping Bai , Guobang Zhou , Yuelong Mao , Peinian Gong , Menghuan Zhang , Dajian Pan","doi":"10.1016/j.envpol.2025.126270","DOIUrl":null,"url":null,"abstract":"<div><div>With the expansion of human industrial activities, heavy metal contamination in groundwater environments has become increasingly severe. Environmental management agencies invest significant financial resources into groundwater monitoring, primarily due to its inherent invisibility. Automatic monitoring is a new way to monitor groundwater, the existing sensors often can only achieve simple indicators, and it is difficult to achieve complex indicators such as heavy metals. This study integrated pH and conductivity online monitoring probes with machine learning algorithms to develop a real-time, automated heavy metal prediction system for groundwater. The predictive performance demonstrated that the highest R<sup>2</sup> values for chromium (Cr), nickel (Ni), and copper (Cu) were 0.73, 0.78, and 0.87, respectively, with mean absolute errors of 11.9, 0.83, and 1.02 μg/L. While random forest and extreme gradient boosting (XGB) models demonstrate greater robustness. To enhance the practicality and management significance of the prediction system, interval prediction is employed. Uncertainty assessment results indicate that the performance order of prediction intervals across different models is XGB > Random Forest > Multiple Linear Regression (MLR) > Backpropagation neural network (BP). We proposed that Groundwater risk is acceptable when the prediction interval of pollutants falls below regional screening levels. The integration of automated sensors with machine learning algorithms can offer advanced recommendations for long-term environmental monitoring.</div></div>","PeriodicalId":311,"journal":{"name":"Environmental Pollution","volume":"374 ","pages":"Article 126270"},"PeriodicalIF":7.6000,"publicationDate":"2025-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Heavy metals prediction system in groundwater using online sensor and machine learning for water management: the case of typical industrial park\",\"authors\":\"Junfan Zhang , Yuzhi Xuan , Jingjing Lei , Liping Bai , Guobang Zhou , Yuelong Mao , Peinian Gong , Menghuan Zhang , Dajian Pan\",\"doi\":\"10.1016/j.envpol.2025.126270\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>With the expansion of human industrial activities, heavy metal contamination in groundwater environments has become increasingly severe. Environmental management agencies invest significant financial resources into groundwater monitoring, primarily due to its inherent invisibility. Automatic monitoring is a new way to monitor groundwater, the existing sensors often can only achieve simple indicators, and it is difficult to achieve complex indicators such as heavy metals. This study integrated pH and conductivity online monitoring probes with machine learning algorithms to develop a real-time, automated heavy metal prediction system for groundwater. The predictive performance demonstrated that the highest R<sup>2</sup> values for chromium (Cr), nickel (Ni), and copper (Cu) were 0.73, 0.78, and 0.87, respectively, with mean absolute errors of 11.9, 0.83, and 1.02 μg/L. While random forest and extreme gradient boosting (XGB) models demonstrate greater robustness. To enhance the practicality and management significance of the prediction system, interval prediction is employed. Uncertainty assessment results indicate that the performance order of prediction intervals across different models is XGB > Random Forest > Multiple Linear Regression (MLR) > Backpropagation neural network (BP). We proposed that Groundwater risk is acceptable when the prediction interval of pollutants falls below regional screening levels. The integration of automated sensors with machine learning algorithms can offer advanced recommendations for long-term environmental monitoring.</div></div>\",\"PeriodicalId\":311,\"journal\":{\"name\":\"Environmental Pollution\",\"volume\":\"374 \",\"pages\":\"Article 126270\"},\"PeriodicalIF\":7.6000,\"publicationDate\":\"2025-04-22\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Environmental Pollution\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0269749125006438\",\"RegionNum\":2,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Pollution","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0269749125006438","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
Heavy metals prediction system in groundwater using online sensor and machine learning for water management: the case of typical industrial park
With the expansion of human industrial activities, heavy metal contamination in groundwater environments has become increasingly severe. Environmental management agencies invest significant financial resources into groundwater monitoring, primarily due to its inherent invisibility. Automatic monitoring is a new way to monitor groundwater, the existing sensors often can only achieve simple indicators, and it is difficult to achieve complex indicators such as heavy metals. This study integrated pH and conductivity online monitoring probes with machine learning algorithms to develop a real-time, automated heavy metal prediction system for groundwater. The predictive performance demonstrated that the highest R2 values for chromium (Cr), nickel (Ni), and copper (Cu) were 0.73, 0.78, and 0.87, respectively, with mean absolute errors of 11.9, 0.83, and 1.02 μg/L. While random forest and extreme gradient boosting (XGB) models demonstrate greater robustness. To enhance the practicality and management significance of the prediction system, interval prediction is employed. Uncertainty assessment results indicate that the performance order of prediction intervals across different models is XGB > Random Forest > Multiple Linear Regression (MLR) > Backpropagation neural network (BP). We proposed that Groundwater risk is acceptable when the prediction interval of pollutants falls below regional screening levels. The integration of automated sensors with machine learning algorithms can offer advanced recommendations for long-term environmental monitoring.
期刊介绍:
Environmental Pollution is an international peer-reviewed journal that publishes high-quality research papers and review articles covering all aspects of environmental pollution and its impacts on ecosystems and human health.
Subject areas include, but are not limited to:
• Sources and occurrences of pollutants that are clearly defined and measured in environmental compartments, food and food-related items, and human bodies;
• Interlinks between contaminant exposure and biological, ecological, and human health effects, including those of climate change;
• Contaminants of emerging concerns (including but not limited to antibiotic resistant microorganisms or genes, microplastics/nanoplastics, electronic wastes, light, and noise) and/or their biological, ecological, or human health effects;
• Laboratory and field studies on the remediation/mitigation of environmental pollution via new techniques and with clear links to biological, ecological, or human health effects;
• Modeling of pollution processes, patterns, or trends that is of clear environmental and/or human health interest;
• New techniques that measure and examine environmental occurrences, transport, behavior, and effects of pollutants within the environment or the laboratory, provided that they can be clearly used to address problems within regional or global environmental compartments.