Spatial prediction of soil cadmium concentration: a multi-model prediction system with novel evaluation metrics

IF 11.3 1区 环境科学与生态学 Q1 ENGINEERING, ENVIRONMENTAL
Ziqian Zhong, Qihong Zhu, Xinliang Liu, Shufang Pan, Lei Luo, Hanhua Zhu, Rui Liu, Daoyou Huang
{"title":"Spatial prediction of soil cadmium concentration: a multi-model prediction system with novel evaluation metrics","authors":"Ziqian Zhong, Qihong Zhu, Xinliang Liu, Shufang Pan, Lei Luo, Hanhua Zhu, Rui Liu, Daoyou Huang","doi":"10.1016/j.jhazmat.2025.139952","DOIUrl":null,"url":null,"abstract":"This study proposes a modular, automated Multi-Model Prediction System (MMPS) to overcome nonlinear relationships, data integration difficulties, and model instability in soil cadmium (Cd) spatial prediction under small sample conditions. We further introduce two novel evaluation metrics — Global Performance Metrics (GPM) to assess robustness across runs, and Global Mean Local Standard Deviation (GMLSD) to quantify spatial stability of prediction maps. Our system achieved high predictive accuracy, with the LGBM model demonstrating superior performance (Global R² = 0.614). Crucially, GPM yielded substantially more consistent evaluations, exhibiting up to an 88.6% reduction in performance standard deviation across random seeds compared to traditional metrics. The MMPS, which integrates geostatistical, deterministic, and machine learning algorithms, was tested in a small mining watershed in Hunan, China. Using multi-source data (remote sensing, topography, soil properties) from 149 samples, we compared 12 models. Results showed that machine learning models achieved higher prediction accuracy but tended to have lower spatial prediction stability (higher GMLSD). Topographic data was the most influential input, boosting performance by 53.5%, whereas spectral or soil data alone showed limited impact. Key driving factors were identified as distance to river, soil pH, B2 band, and organic matter. The MMPS provides a high-precision, low-cost intelligent tool for pollution management. Together with our robust evaluation framework, this research offers a new paradigm for environmental spatial prediction, enabling smarter, scalable, and more reliable pollution control strategies.","PeriodicalId":361,"journal":{"name":"Journal of Hazardous Materials","volume":"41 1","pages":""},"PeriodicalIF":11.3000,"publicationDate":"2025-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Hazardous Materials","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1016/j.jhazmat.2025.139952","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 0

Abstract

This study proposes a modular, automated Multi-Model Prediction System (MMPS) to overcome nonlinear relationships, data integration difficulties, and model instability in soil cadmium (Cd) spatial prediction under small sample conditions. We further introduce two novel evaluation metrics — Global Performance Metrics (GPM) to assess robustness across runs, and Global Mean Local Standard Deviation (GMLSD) to quantify spatial stability of prediction maps. Our system achieved high predictive accuracy, with the LGBM model demonstrating superior performance (Global R² = 0.614). Crucially, GPM yielded substantially more consistent evaluations, exhibiting up to an 88.6% reduction in performance standard deviation across random seeds compared to traditional metrics. The MMPS, which integrates geostatistical, deterministic, and machine learning algorithms, was tested in a small mining watershed in Hunan, China. Using multi-source data (remote sensing, topography, soil properties) from 149 samples, we compared 12 models. Results showed that machine learning models achieved higher prediction accuracy but tended to have lower spatial prediction stability (higher GMLSD). Topographic data was the most influential input, boosting performance by 53.5%, whereas spectral or soil data alone showed limited impact. Key driving factors were identified as distance to river, soil pH, B2 band, and organic matter. The MMPS provides a high-precision, low-cost intelligent tool for pollution management. Together with our robust evaluation framework, this research offers a new paradigm for environmental spatial prediction, enabling smarter, scalable, and more reliable pollution control strategies.

Abstract Image

土壤镉浓度空间预测:一种具有新评价指标的多模型预测系统
本文提出了一种模块化、自动化的多模型预测系统(MMPS),以克服小样本条件下土壤镉(Cd)空间预测中的非线性关系、数据集成困难和模型不稳定性。我们进一步引入了两种新的评估指标——用于评估跨运行稳健性的全局性能指标(GPM)和量化预测图空间稳定性的全局平均局部标准偏差(GMLSD)。该系统具有较高的预测精度,其中LGBM模型表现出较好的性能(全局R²= 0.614)。至关重要的是,GPM产生了更加一致的评估,与传统指标相比,随机种子的性能标准差降低了88.6%。MMPS集成了地质统计学、确定性和机器学习算法,在中国湖南的一个小矿区流域进行了测试。利用来自149个样本的多源数据(遥感、地形、土壤性质),我们比较了12种模型。结果表明,机器学习模型的预测精度较高,但空间预测稳定性较低(GMLSD较高)。地形数据是最具影响力的输入,将性能提高了53.5%,而光谱或土壤数据本身的影响有限。与河流的距离、土壤pH值、B2带和有机质为主要驱动因子。MMPS为污染管理提供了高精度、低成本的智能工具。结合我们强大的评估框架,该研究为环境空间预测提供了一个新的范例,从而实现更智能、可扩展和更可靠的污染控制策略。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Hazardous Materials
Journal of Hazardous Materials 工程技术-工程:环境
CiteScore
25.40
自引率
5.90%
发文量
3059
审稿时长
58 days
期刊介绍: The Journal of Hazardous Materials serves as a global platform for promoting cutting-edge research in the field of Environmental Science and Engineering. Our publication features a wide range of articles, including full-length research papers, review articles, and perspectives, with the aim of enhancing our understanding of the dangers and risks associated with various materials concerning public health and the environment. It is important to note that the term "environmental contaminants" refers specifically to substances that pose hazardous effects through contamination, while excluding those that do not have such impacts on the environment or human health. Moreover, we emphasize the distinction between wastes and hazardous materials in order to provide further clarity on the scope of the journal. We have a keen interest in exploring specific compounds and microbial agents that have adverse effects on the environment.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信