机器学习支持的土壤重金属特定地点自然背景值测定

IF 11.3 1区 环境科学与生态学 Q1 ENGINEERING, ENVIRONMENTAL
Jian Wu, Chengmin Huang
{"title":"机器学习支持的土壤重金属特定地点自然背景值测定","authors":"Jian Wu, Chengmin Huang","doi":"10.1016/j.jhazmat.2025.137276","DOIUrl":null,"url":null,"abstract":"Heavy metal natural background values play a crucial role in distinguishing anthropogenic sources from natural sources to assess human impacts in polluted areas, thereby accurately formulating environmental policies. However, due to limitations imposed by human activities, research methods, and regional constraints, the determination of heavy metal background values, particularly on site or profile scale, is often challenging, highlighting the urgent need for new methodologies. To establish a comprehensive dataset containing heavy metal concentrations and soil properties, the study systematically collected and screened 82 soil profiles from areas minimally affected by human activities, resulting in a total of 2,185 data sets. Using soil depth, pH, organic matter, weathering indices (SAF, BA), Fe<sub>2</sub>O<sub>3</sub>, MgO, Na<sub>2</sub>O, CaO, and K<sub>2</sub>O as model input variables, the predictive performance for site-specific background levels of Cd, Cr, Cu, Ni, Pb, and Zn was compared across four advanced machine learning models (RF (random forest), XGBoost (extreme gradient boosting), ANN (artificial neural network), SVR (support vector regression)). The results indicated that the optimal model for predicting background values of Cd, Cr, and Ni was XGBoost (MAE = 0.14 – 0.17; MSE = 0.04 – 0.06; R² = 0.82 – 0.87), while RF was used for Cu, Pb, and Zn (MAE = 0.01 – 0.18; MSE = 0.02 – 0.06; R² = 0.89 – 0.95). Importance assessments using RF and SHAP revealed that pH is a key controlling factor for Cd and Ni, Fe<sub>2</sub>O<sub>3</sub> significantly impacts Cr, Cu, and Zn background levels, and K<sub>2</sub>O is the main controlling factor for Pb. The machine learning models developed can effectively predict the background levels of these six heavy metals based on major elemental and soil physicochemical properties, particularly achieving accurate predictions for Cu and Zn using just two input variables. This machine learning prediction framework is based on major elemental compositions and the physical/chemical properties of soil, enables precise and cost-effective point-to-point environmental assessments, thereby offering significant potential for practical applications.","PeriodicalId":361,"journal":{"name":"Journal of Hazardous Materials","volume":"51 1","pages":""},"PeriodicalIF":11.3000,"publicationDate":"2025-01-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine learning-supported determination for site-specific natural background values of soil heavy metals\",\"authors\":\"Jian Wu, Chengmin Huang\",\"doi\":\"10.1016/j.jhazmat.2025.137276\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Heavy metal natural background values play a crucial role in distinguishing anthropogenic sources from natural sources to assess human impacts in polluted areas, thereby accurately formulating environmental policies. However, due to limitations imposed by human activities, research methods, and regional constraints, the determination of heavy metal background values, particularly on site or profile scale, is often challenging, highlighting the urgent need for new methodologies. To establish a comprehensive dataset containing heavy metal concentrations and soil properties, the study systematically collected and screened 82 soil profiles from areas minimally affected by human activities, resulting in a total of 2,185 data sets. Using soil depth, pH, organic matter, weathering indices (SAF, BA), Fe<sub>2</sub>O<sub>3</sub>, MgO, Na<sub>2</sub>O, CaO, and K<sub>2</sub>O as model input variables, the predictive performance for site-specific background levels of Cd, Cr, Cu, Ni, Pb, and Zn was compared across four advanced machine learning models (RF (random forest), XGBoost (extreme gradient boosting), ANN (artificial neural network), SVR (support vector regression)). The results indicated that the optimal model for predicting background values of Cd, Cr, and Ni was XGBoost (MAE = 0.14 – 0.17; MSE = 0.04 – 0.06; R² = 0.82 – 0.87), while RF was used for Cu, Pb, and Zn (MAE = 0.01 – 0.18; MSE = 0.02 – 0.06; R² = 0.89 – 0.95). Importance assessments using RF and SHAP revealed that pH is a key controlling factor for Cd and Ni, Fe<sub>2</sub>O<sub>3</sub> significantly impacts Cr, Cu, and Zn background levels, and K<sub>2</sub>O is the main controlling factor for Pb. The machine learning models developed can effectively predict the background levels of these six heavy metals based on major elemental and soil physicochemical properties, particularly achieving accurate predictions for Cu and Zn using just two input variables. This machine learning prediction framework is based on major elemental compositions and the physical/chemical properties of soil, enables precise and cost-effective point-to-point environmental assessments, thereby offering significant potential for practical applications.\",\"PeriodicalId\":361,\"journal\":{\"name\":\"Journal of Hazardous Materials\",\"volume\":\"51 1\",\"pages\":\"\"},\"PeriodicalIF\":11.3000,\"publicationDate\":\"2025-01-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Hazardous Materials\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://doi.org/10.1016/j.jhazmat.2025.137276\",\"RegionNum\":1,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ENVIRONMENTAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Hazardous Materials","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1016/j.jhazmat.2025.137276","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 0

摘要

重金属自然背景值对于区分人为来源和自然来源,评估污染地区的人类影响,从而准确制定环境政策具有至关重要的作用。然而,由于人类活动、研究方法和区域限制的限制,重金属背景值的确定,特别是在现场或剖面尺度上的确定,往往具有挑战性,迫切需要新的方法。为了建立一个包含重金属浓度和土壤性质的综合数据集,该研究系统地收集和筛选了82个受人类活动影响最小的地区的土壤剖面,总共产生了2185个数据集。以土壤深度、pH值、有机质、风化指数(SAF、BA)、Fe2O3、MgO、Na2O、CaO和K2O为模型输入变量,比较了四种先进的机器学习模型(RF(随机森林)、XGBoost(极端梯度增强)、ANN(人工神经网络)和SVR(支持向量回归))对特定地点Cd、Cr、Cu、Ni、Pb和Zn背景水平的预测性能。结果表明,预测Cd、Cr、Ni背景值的最佳模型为XGBoost (MAE = 0.14 ~ 0.17;Mse = 0.04 ~ 0.06;R²= 0.82 ~ 0.87),RF用于Cu、Pb和Zn (MAE = 0.01 ~ 0.18;Mse = 0.02 ~ 0.06;R²= 0.89 - 0.95)。利用RF和SHAP进行的重要性评价表明,pH是Cd和Ni的关键控制因素,Fe2O3显著影响Cr、Cu和Zn的背景水平,K2O是Pb的主要控制因素。开发的机器学习模型可以根据主要元素和土壤物理化学性质有效地预测这六种重金属的背景水平,特别是仅使用两个输入变量就可以准确预测Cu和Zn。这种机器学习预测框架基于土壤的主要元素组成和物理/化学性质,能够进行精确且具有成本效益的点对点环境评估,从而为实际应用提供了巨大的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Machine learning-supported determination for site-specific natural background values of soil heavy metals

Machine learning-supported determination for site-specific natural background values of soil heavy metals
Heavy metal natural background values play a crucial role in distinguishing anthropogenic sources from natural sources to assess human impacts in polluted areas, thereby accurately formulating environmental policies. However, due to limitations imposed by human activities, research methods, and regional constraints, the determination of heavy metal background values, particularly on site or profile scale, is often challenging, highlighting the urgent need for new methodologies. To establish a comprehensive dataset containing heavy metal concentrations and soil properties, the study systematically collected and screened 82 soil profiles from areas minimally affected by human activities, resulting in a total of 2,185 data sets. Using soil depth, pH, organic matter, weathering indices (SAF, BA), Fe2O3, MgO, Na2O, CaO, and K2O as model input variables, the predictive performance for site-specific background levels of Cd, Cr, Cu, Ni, Pb, and Zn was compared across four advanced machine learning models (RF (random forest), XGBoost (extreme gradient boosting), ANN (artificial neural network), SVR (support vector regression)). The results indicated that the optimal model for predicting background values of Cd, Cr, and Ni was XGBoost (MAE = 0.14 – 0.17; MSE = 0.04 – 0.06; R² = 0.82 – 0.87), while RF was used for Cu, Pb, and Zn (MAE = 0.01 – 0.18; MSE = 0.02 – 0.06; R² = 0.89 – 0.95). Importance assessments using RF and SHAP revealed that pH is a key controlling factor for Cd and Ni, Fe2O3 significantly impacts Cr, Cu, and Zn background levels, and K2O is the main controlling factor for Pb. The machine learning models developed can effectively predict the background levels of these six heavy metals based on major elemental and soil physicochemical properties, particularly achieving accurate predictions for Cu and Zn using just two input variables. This machine learning prediction framework is based on major elemental compositions and the physical/chemical properties of soil, enables precise and cost-effective point-to-point environmental assessments, thereby offering significant potential for practical applications.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Hazardous Materials
Journal of Hazardous Materials 工程技术-工程:环境
CiteScore
25.40
自引率
5.90%
发文量
3059
审稿时长
58 days
期刊介绍: The Journal of Hazardous Materials serves as a global platform for promoting cutting-edge research in the field of Environmental Science and Engineering. Our publication features a wide range of articles, including full-length research papers, review articles, and perspectives, with the aim of enhancing our understanding of the dangers and risks associated with various materials concerning public health and the environment. It is important to note that the term "environmental contaminants" refers specifically to substances that pose hazardous effects through contamination, while excluding those that do not have such impacts on the environment or human health. Moreover, we emphasize the distinction between wastes and hazardous materials in order to provide further clarity on the scope of the journal. We have a keen interest in exploring specific compounds and microbial agents that have adverse effects on the environment.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信