A novel graph convolutional neural network model for predicting soil Cd and As pollution: Identification of influencing factors and interpretability

IF 6.2 2区 环境科学与生态学 Q1 ENVIRONMENTAL SCIENCES
Ren-Jie Zhang , Xiong-Hui Ji , Yun-He Xie , Tao Xue , Sai-Hua Liu , Fa-Xiang Tian , Shu-Fang Pan
{"title":"A novel graph convolutional neural network model for predicting soil Cd and As pollution: Identification of influencing factors and interpretability","authors":"Ren-Jie Zhang ,&nbsp;Xiong-Hui Ji ,&nbsp;Yun-He Xie ,&nbsp;Tao Xue ,&nbsp;Sai-Hua Liu ,&nbsp;Fa-Xiang Tian ,&nbsp;Shu-Fang Pan","doi":"10.1016/j.ecoenv.2025.117926","DOIUrl":null,"url":null,"abstract":"<div><div>Soil pollution caused by toxic metals poses serious threats to the ecological environment and human well-being. Accurately predicting toxic metal concentrations is critical for safeguarding soil environmental security. However, the distribution of soil toxic metal concentrations often exhibits significant spatial heterogeneity and intricate correlations with other environmental influencing factors, posing substantial challenges to accurate prediction. This study delves into the prospective application of a novel graph convolutional neural network model, namely DistNet-GCN. By capitalizing on the spatial relationships among sampling points, this model endeavors to predict cadmium (Cd) and arsenic (As) concentrations in soil. The distinctive feature of this model resides in its capacity to mimic the transmission process of relationships between soil Cd/As concentrations and the environmental influencing factors within a local spatial scope by integrating the powerful ability of GCN to extract the inter-node dependencies in complex networks. Subsequently, it extracts the critical features of the dataset from a spatial relationship graph structure by taking the spatial positions of sampling points as network nodes, the concentrations of toxic metals as node labels, and environmental factors as node attributes. In comparison with traditional models, the DistNet-GCN model achieves the highest prediction accuracy for soil Cd and As concentrations. Specifically, the R<sup>2</sup> values reach 0.91 and 0.94 respectively, which signify improvements of 21.33 % and 9.30 % over those of Multiple Linear Regression (MLR). The outcome of the interpretability analysis shows that the urban human activities, mining operation, pH, and soil organic matter (SOM) are the most important environmental factors affecting the spatial distribution of soil Cd/As concentrations in the study area. Additionally, the local spatial autocorrelation findings reveal that the Moran’s I values for Cd and As are 0.796 and 0.897, respectively, which validate the structural soundness and rationality of the DistNet-GCN model. This study enlightens a novel approach of soil Cd/As concentrations prediction by integrating spatial graph structures into the deep learning models and is significant for uncovering the complex correlations between toxic metal concentrations in soil and various environmental factors.</div></div>","PeriodicalId":303,"journal":{"name":"Ecotoxicology and Environmental Safety","volume":"292 ","pages":"Article 117926"},"PeriodicalIF":6.2000,"publicationDate":"2025-02-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ecotoxicology and Environmental Safety","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0147651325002623","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Soil pollution caused by toxic metals poses serious threats to the ecological environment and human well-being. Accurately predicting toxic metal concentrations is critical for safeguarding soil environmental security. However, the distribution of soil toxic metal concentrations often exhibits significant spatial heterogeneity and intricate correlations with other environmental influencing factors, posing substantial challenges to accurate prediction. This study delves into the prospective application of a novel graph convolutional neural network model, namely DistNet-GCN. By capitalizing on the spatial relationships among sampling points, this model endeavors to predict cadmium (Cd) and arsenic (As) concentrations in soil. The distinctive feature of this model resides in its capacity to mimic the transmission process of relationships between soil Cd/As concentrations and the environmental influencing factors within a local spatial scope by integrating the powerful ability of GCN to extract the inter-node dependencies in complex networks. Subsequently, it extracts the critical features of the dataset from a spatial relationship graph structure by taking the spatial positions of sampling points as network nodes, the concentrations of toxic metals as node labels, and environmental factors as node attributes. In comparison with traditional models, the DistNet-GCN model achieves the highest prediction accuracy for soil Cd and As concentrations. Specifically, the R2 values reach 0.91 and 0.94 respectively, which signify improvements of 21.33 % and 9.30 % over those of Multiple Linear Regression (MLR). The outcome of the interpretability analysis shows that the urban human activities, mining operation, pH, and soil organic matter (SOM) are the most important environmental factors affecting the spatial distribution of soil Cd/As concentrations in the study area. Additionally, the local spatial autocorrelation findings reveal that the Moran’s I values for Cd and As are 0.796 and 0.897, respectively, which validate the structural soundness and rationality of the DistNet-GCN model. This study enlightens a novel approach of soil Cd/As concentrations prediction by integrating spatial graph structures into the deep learning models and is significant for uncovering the complex correlations between toxic metal concentrations in soil and various environmental factors.
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
12.10
自引率
5.90%
发文量
1234
审稿时长
88 days
期刊介绍: Ecotoxicology and Environmental Safety is a multi-disciplinary journal that focuses on understanding the exposure and effects of environmental contamination on organisms including human health. The scope of the journal covers three main themes. The topics within these themes, indicated below, include (but are not limited to) the following: Ecotoxicology、Environmental Chemistry、Environmental Safety etc.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信