利用先进的机器学习预测土壤镉、铅和砷的生物可及性,以确定中国大陆尺度的土壤环境标准

Kunting Xie, Jiajun Ou, Minghao He, Weijie Peng and Yong Yuan*, 
{"title":"利用先进的机器学习预测土壤镉、铅和砷的生物可及性,以确定中国大陆尺度的土壤环境标准","authors":"Kunting Xie,&nbsp;Jiajun Ou,&nbsp;Minghao He,&nbsp;Weijie Peng and Yong Yuan*,&nbsp;","doi":"10.1021/envhealth.4c0003510.1021/envhealth.4c00035","DOIUrl":null,"url":null,"abstract":"<p >Investigating the bioaccessibility of harmful inorganic elements in soil is crucial for understanding their behavior in the environment and accurately assessing the environmental risks associated with soil. Traditional batch experimental methods and linear models, however, are time-consuming and often fall short in precisely quantifying bioaccessibility. In this study, using 937 data points gathered from 56 journal articles, we developed machine learning models for three harmful inorganic elements, namely, Cd, Pb, and As. After thorough analysis, the model optimized through a boosting ensemble strategy demonstrated the best performance, with an average <i>R</i><sup>2</sup> of 0.95 and an RMSE of 0.25. We further employed SHAP values in conjunction with quantitative analysis to identify the key features that influence bioaccessibility. By utilizing the developed integrated models, we carried out predictions for 3002 data points across China, clarifying the bioaccessibility of cadmium (Cd), lead (Pb), and arsenic (As) in the soils of various sites and constructed a comprehensive spatial distribution map of China using the inverse distance weighting (IDW) interpolation method. Based on these findings, we further derived the soil environmental standards for metallurgical sites in China. Our observations from the collected data indicate a reduction in the number of sites exceeding the standard levels for Cd, Pb, and As in mining/smelting sites from 5, 58, and 14 to 1, 24, and 7, respectively. This research offers a precise and scientific approach for cross-regional risk assessment at the continental scale and lays a solid foundation for soil environmental management.</p>","PeriodicalId":29795,"journal":{"name":"Environment & Health","volume":"2 9","pages":"631–641 631–641"},"PeriodicalIF":0.0000,"publicationDate":"2024-06-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.acs.org/doi/epdf/10.1021/envhealth.4c00035","citationCount":"0","resultStr":"{\"title\":\"Predicting the Bioaccessibility of Soil Cd, Pb, and As with Advanced Machine Learning for Continental-Scale Soil Environmental Criteria Determination in China\",\"authors\":\"Kunting Xie,&nbsp;Jiajun Ou,&nbsp;Minghao He,&nbsp;Weijie Peng and Yong Yuan*,&nbsp;\",\"doi\":\"10.1021/envhealth.4c0003510.1021/envhealth.4c00035\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p >Investigating the bioaccessibility of harmful inorganic elements in soil is crucial for understanding their behavior in the environment and accurately assessing the environmental risks associated with soil. Traditional batch experimental methods and linear models, however, are time-consuming and often fall short in precisely quantifying bioaccessibility. In this study, using 937 data points gathered from 56 journal articles, we developed machine learning models for three harmful inorganic elements, namely, Cd, Pb, and As. After thorough analysis, the model optimized through a boosting ensemble strategy demonstrated the best performance, with an average <i>R</i><sup>2</sup> of 0.95 and an RMSE of 0.25. We further employed SHAP values in conjunction with quantitative analysis to identify the key features that influence bioaccessibility. By utilizing the developed integrated models, we carried out predictions for 3002 data points across China, clarifying the bioaccessibility of cadmium (Cd), lead (Pb), and arsenic (As) in the soils of various sites and constructed a comprehensive spatial distribution map of China using the inverse distance weighting (IDW) interpolation method. Based on these findings, we further derived the soil environmental standards for metallurgical sites in China. Our observations from the collected data indicate a reduction in the number of sites exceeding the standard levels for Cd, Pb, and As in mining/smelting sites from 5, 58, and 14 to 1, 24, and 7, respectively. This research offers a precise and scientific approach for cross-regional risk assessment at the continental scale and lays a solid foundation for soil environmental management.</p>\",\"PeriodicalId\":29795,\"journal\":{\"name\":\"Environment & Health\",\"volume\":\"2 9\",\"pages\":\"631–641 631–641\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-06-06\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://pubs.acs.org/doi/epdf/10.1021/envhealth.4c00035\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Environment & Health\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://pubs.acs.org/doi/10.1021/envhealth.4c00035\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environment & Health","FirstCategoryId":"1085","ListUrlMain":"https://pubs.acs.org/doi/10.1021/envhealth.4c00035","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

调查土壤中有害无机元素的生物可及性对于了解其在环境中的行为以及准确评估与土壤相关的环境风险至关重要。然而,传统的批量实验方法和线性模型耗时较长,而且往往无法精确量化生物可及性。在本研究中,我们利用从 56 篇期刊论文中收集到的 937 个数据点,开发了针对镉、铅和砷这三种有害无机元素的机器学习模型。经过全面分析,通过提升集合策略优化的模型表现最佳,平均 R2 为 0.95,RMSE 为 0.25。我们进一步利用 SHAP 值与定量分析相结合,确定了影响生物可及性的关键特征。利用所开发的综合模型,我们对全国 3002 个数据点进行了预测,明确了不同地点土壤中镉 (Cd)、铅 (Pb) 和砷 (As) 的生物可及性,并利用反距离加权 (IDW) 插值法构建了中国综合空间分布图。在此基础上,我们进一步得出了中国冶金场地土壤环境标准。我们从收集的数据中观察到,采矿/冶炼场地中镉、铅和砷超标的场地数量分别从 5 个、58 个和 14 个减少到 1 个、24 个和 7 个。这项研究为在大陆范围内进行跨区域风险评估提供了精确而科学的方法,为土壤环境管理奠定了坚实的基础。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Predicting the Bioaccessibility of Soil Cd, Pb, and As with Advanced Machine Learning for Continental-Scale Soil Environmental Criteria Determination in China

Predicting the Bioaccessibility of Soil Cd, Pb, and As with Advanced Machine Learning for Continental-Scale Soil Environmental Criteria Determination in China

Investigating the bioaccessibility of harmful inorganic elements in soil is crucial for understanding their behavior in the environment and accurately assessing the environmental risks associated with soil. Traditional batch experimental methods and linear models, however, are time-consuming and often fall short in precisely quantifying bioaccessibility. In this study, using 937 data points gathered from 56 journal articles, we developed machine learning models for three harmful inorganic elements, namely, Cd, Pb, and As. After thorough analysis, the model optimized through a boosting ensemble strategy demonstrated the best performance, with an average R2 of 0.95 and an RMSE of 0.25. We further employed SHAP values in conjunction with quantitative analysis to identify the key features that influence bioaccessibility. By utilizing the developed integrated models, we carried out predictions for 3002 data points across China, clarifying the bioaccessibility of cadmium (Cd), lead (Pb), and arsenic (As) in the soils of various sites and constructed a comprehensive spatial distribution map of China using the inverse distance weighting (IDW) interpolation method. Based on these findings, we further derived the soil environmental standards for metallurgical sites in China. Our observations from the collected data indicate a reduction in the number of sites exceeding the standard levels for Cd, Pb, and As in mining/smelting sites from 5, 58, and 14 to 1, 24, and 7, respectively. This research offers a precise and scientific approach for cross-regional risk assessment at the continental scale and lays a solid foundation for soil environmental management.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Environment & Health
Environment & Health 环境科学、健康科学-
自引率
0.00%
发文量
0
期刊介绍: Environment & Health a peer-reviewed open access journal is committed to exploring the relationship between the environment and human health.As a premier journal for multidisciplinary research Environment & Health reports the health consequences for individuals and communities of changing and hazardous environmental factors. In supporting the UN Sustainable Development Goals the journal aims to help formulate policies to create a healthier world.Topics of interest include but are not limited to:Air water and soil pollutionExposomicsEnvironmental epidemiologyInnovative analytical methodology and instrumentation (multi-omics non-target analysis effect-directed analysis high-throughput screening etc.)Environmental toxicology (endocrine disrupting effect neurotoxicity alternative toxicology computational toxicology epigenetic toxicology etc.)Environmental microbiology pathogen and environmental transmission mechanisms of diseasesEnvironmental modeling bioinformatics and artificial intelligenceEmerging contaminants (including plastics engineered nanomaterials etc.)Climate change and related health effectHealth impacts of energy evolution and carbon neutralizationFood and drinking water safetyOccupational exposure and medicineInnovations in environmental technologies for better healthPolicies and international relations concerned with environmental health
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信