Ren-Jie Zhang , Shu-Fang Pan , Huai-Zeng Xing , Tian-Hao Dong , Sai-Hua Liu , Tao Xue , Fa-Xiang Tian , Hong-Yu Fu , Yun-He Xie , Xiong-Hui Ji
{"title":"Multigraph fusion neural network for predicting available cadmium levels in farmland soils","authors":"Ren-Jie Zhang , Shu-Fang Pan , Huai-Zeng Xing , Tian-Hao Dong , Sai-Hua Liu , Tao Xue , Fa-Xiang Tian , Hong-Yu Fu , Yun-He Xie , Xiong-Hui Ji","doi":"10.1016/j.envpol.2025.127225","DOIUrl":null,"url":null,"abstract":"<div><div>Cadmium (Cd), a primary heavy metal pollutant in farmland soils, poses a significant threat to soil ecosystems and human health, developing accurate machine learning models to predict soil available Cd concentration is crucial for formulating effective remediation strategies. However, most existing models rely on the simple aggregation of multiple environmental and geographical variables to predict soil available Cd concentration, often neglecting the complex interrelationships among these variables and the spatial effects of geographic factors. In this work, a novel multigraph fusion neural network model based on the spatial relationships between sampling points and various geographic factors (elevation, mine, roads, and rivers) is proposed. By integrating multiple spatial relationship graphs, the model effectively captures the spatial effects of geographic factors on the farmland soil environment. The results demonstrate that the multigraph fusion model significantly outperforms the other models in predicting soil available Cd concentration, achieving R<sup>2</sup> value of 0.82, RMSE value of 0.0338 mg kg<sup>−1</sup>, and MAE value of 0.0249 mg kg<sup>−1</sup>. Compared with the single-graph models and baseline models, the multigraph fusion model provides lower residual distribution range and more stable prediction performance. These findings validate the feasibility of incorporating the spatial effects of geographic factors to increase the prediction performance of soil available Cd concentration models and offer valuable analysis tools into the environmental drivers underlying the spatial heterogeneity in heavy metal contamination in farmland soils.</div></div>","PeriodicalId":311,"journal":{"name":"Environmental Pollution","volume":"386 ","pages":"Article 127225"},"PeriodicalIF":7.3000,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Pollution","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0269749125015994","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Cadmium (Cd), a primary heavy metal pollutant in farmland soils, poses a significant threat to soil ecosystems and human health, developing accurate machine learning models to predict soil available Cd concentration is crucial for formulating effective remediation strategies. However, most existing models rely on the simple aggregation of multiple environmental and geographical variables to predict soil available Cd concentration, often neglecting the complex interrelationships among these variables and the spatial effects of geographic factors. In this work, a novel multigraph fusion neural network model based on the spatial relationships between sampling points and various geographic factors (elevation, mine, roads, and rivers) is proposed. By integrating multiple spatial relationship graphs, the model effectively captures the spatial effects of geographic factors on the farmland soil environment. The results demonstrate that the multigraph fusion model significantly outperforms the other models in predicting soil available Cd concentration, achieving R2 value of 0.82, RMSE value of 0.0338 mg kg−1, and MAE value of 0.0249 mg kg−1. Compared with the single-graph models and baseline models, the multigraph fusion model provides lower residual distribution range and more stable prediction performance. These findings validate the feasibility of incorporating the spatial effects of geographic factors to increase the prediction performance of soil available Cd concentration models and offer valuable analysis tools into the environmental drivers underlying the spatial heterogeneity in heavy metal contamination in farmland soils.
期刊介绍:
Environmental Pollution is an international peer-reviewed journal that publishes high-quality research papers and review articles covering all aspects of environmental pollution and its impacts on ecosystems and human health.
Subject areas include, but are not limited to:
• Sources and occurrences of pollutants that are clearly defined and measured in environmental compartments, food and food-related items, and human bodies;
• Interlinks between contaminant exposure and biological, ecological, and human health effects, including those of climate change;
• Contaminants of emerging concerns (including but not limited to antibiotic resistant microorganisms or genes, microplastics/nanoplastics, electronic wastes, light, and noise) and/or their biological, ecological, or human health effects;
• Laboratory and field studies on the remediation/mitigation of environmental pollution via new techniques and with clear links to biological, ecological, or human health effects;
• Modeling of pollution processes, patterns, or trends that is of clear environmental and/or human health interest;
• New techniques that measure and examine environmental occurrences, transport, behavior, and effects of pollutants within the environment or the laboratory, provided that they can be clearly used to address problems within regional or global environmental compartments.