{"title":"土壤镉浓度空间预测:一种具有新评价指标的多模型预测系统","authors":"Ziqian Zhong, Qihong Zhu, Xinliang Liu, Shufang Pan, Lei Luo, Hanhua Zhu, Rui Liu, Daoyou Huang","doi":"10.1016/j.jhazmat.2025.139952","DOIUrl":null,"url":null,"abstract":"This study proposes a modular, automated Multi-Model Prediction System (MMPS) to overcome nonlinear relationships, data integration difficulties, and model instability in soil cadmium (Cd) spatial prediction under small sample conditions. We further introduce two novel evaluation metrics — Global Performance Metrics (GPM) to assess robustness across runs, and Global Mean Local Standard Deviation (GMLSD) to quantify spatial stability of prediction maps. Our system achieved high predictive accuracy, with the LGBM model demonstrating superior performance (Global R² = 0.614). Crucially, GPM yielded substantially more consistent evaluations, exhibiting up to an 88.6% reduction in performance standard deviation across random seeds compared to traditional metrics. The MMPS, which integrates geostatistical, deterministic, and machine learning algorithms, was tested in a small mining watershed in Hunan, China. Using multi-source data (remote sensing, topography, soil properties) from 149 samples, we compared 12 models. Results showed that machine learning models achieved higher prediction accuracy but tended to have lower spatial prediction stability (higher GMLSD). Topographic data was the most influential input, boosting performance by 53.5%, whereas spectral or soil data alone showed limited impact. Key driving factors were identified as distance to river, soil pH, B2 band, and organic matter. The MMPS provides a high-precision, low-cost intelligent tool for pollution management. Together with our robust evaluation framework, this research offers a new paradigm for environmental spatial prediction, enabling smarter, scalable, and more reliable pollution control strategies.","PeriodicalId":361,"journal":{"name":"Journal of Hazardous Materials","volume":"41 1","pages":""},"PeriodicalIF":11.3000,"publicationDate":"2025-09-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Spatial prediction of soil cadmium concentration: a multi-model prediction system with novel evaluation metrics\",\"authors\":\"Ziqian Zhong, Qihong Zhu, Xinliang Liu, Shufang Pan, Lei Luo, Hanhua Zhu, Rui Liu, Daoyou Huang\",\"doi\":\"10.1016/j.jhazmat.2025.139952\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This study proposes a modular, automated Multi-Model Prediction System (MMPS) to overcome nonlinear relationships, data integration difficulties, and model instability in soil cadmium (Cd) spatial prediction under small sample conditions. We further introduce two novel evaluation metrics — Global Performance Metrics (GPM) to assess robustness across runs, and Global Mean Local Standard Deviation (GMLSD) to quantify spatial stability of prediction maps. Our system achieved high predictive accuracy, with the LGBM model demonstrating superior performance (Global R² = 0.614). Crucially, GPM yielded substantially more consistent evaluations, exhibiting up to an 88.6% reduction in performance standard deviation across random seeds compared to traditional metrics. The MMPS, which integrates geostatistical, deterministic, and machine learning algorithms, was tested in a small mining watershed in Hunan, China. Using multi-source data (remote sensing, topography, soil properties) from 149 samples, we compared 12 models. Results showed that machine learning models achieved higher prediction accuracy but tended to have lower spatial prediction stability (higher GMLSD). Topographic data was the most influential input, boosting performance by 53.5%, whereas spectral or soil data alone showed limited impact. Key driving factors were identified as distance to river, soil pH, B2 band, and organic matter. The MMPS provides a high-precision, low-cost intelligent tool for pollution management. Together with our robust evaluation framework, this research offers a new paradigm for environmental spatial prediction, enabling smarter, scalable, and more reliable pollution control strategies.\",\"PeriodicalId\":361,\"journal\":{\"name\":\"Journal of Hazardous Materials\",\"volume\":\"41 1\",\"pages\":\"\"},\"PeriodicalIF\":11.3000,\"publicationDate\":\"2025-09-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Hazardous Materials\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://doi.org/10.1016/j.jhazmat.2025.139952\",\"RegionNum\":1,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ENVIRONMENTAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Hazardous Materials","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1016/j.jhazmat.2025.139952","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
Spatial prediction of soil cadmium concentration: a multi-model prediction system with novel evaluation metrics
This study proposes a modular, automated Multi-Model Prediction System (MMPS) to overcome nonlinear relationships, data integration difficulties, and model instability in soil cadmium (Cd) spatial prediction under small sample conditions. We further introduce two novel evaluation metrics — Global Performance Metrics (GPM) to assess robustness across runs, and Global Mean Local Standard Deviation (GMLSD) to quantify spatial stability of prediction maps. Our system achieved high predictive accuracy, with the LGBM model demonstrating superior performance (Global R² = 0.614). Crucially, GPM yielded substantially more consistent evaluations, exhibiting up to an 88.6% reduction in performance standard deviation across random seeds compared to traditional metrics. The MMPS, which integrates geostatistical, deterministic, and machine learning algorithms, was tested in a small mining watershed in Hunan, China. Using multi-source data (remote sensing, topography, soil properties) from 149 samples, we compared 12 models. Results showed that machine learning models achieved higher prediction accuracy but tended to have lower spatial prediction stability (higher GMLSD). Topographic data was the most influential input, boosting performance by 53.5%, whereas spectral or soil data alone showed limited impact. Key driving factors were identified as distance to river, soil pH, B2 band, and organic matter. The MMPS provides a high-precision, low-cost intelligent tool for pollution management. Together with our robust evaluation framework, this research offers a new paradigm for environmental spatial prediction, enabling smarter, scalable, and more reliable pollution control strategies.
期刊介绍:
The Journal of Hazardous Materials serves as a global platform for promoting cutting-edge research in the field of Environmental Science and Engineering. Our publication features a wide range of articles, including full-length research papers, review articles, and perspectives, with the aim of enhancing our understanding of the dangers and risks associated with various materials concerning public health and the environment. It is important to note that the term "environmental contaminants" refers specifically to substances that pose hazardous effects through contamination, while excluding those that do not have such impacts on the environment or human health. Moreover, we emphasize the distinction between wastes and hazardous materials in order to provide further clarity on the scope of the journal. We have a keen interest in exploring specific compounds and microbial agents that have adverse effects on the environment.