[Source Apportionment of Heavy Metals in Soils Based on Machine Learning Algorithms and Receptor Model].

Q2 Environmental Science
Jie Ma, Ming-Sheng Li, Xue Feng
{"title":"[Source Apportionment of Heavy Metals in Soils Based on Machine Learning Algorithms and Receptor Model].","authors":"Jie Ma, Ming-Sheng Li, Xue Feng","doi":"10.13227/j.hjkx.202408149","DOIUrl":null,"url":null,"abstract":"<p><p>To analyze the source apportionment and influence factors of heavy metals in soils surrounding a coal gangue heap in Chongqing, three machine learning algorithms (decision tree (DT), random forest (RF), and support vector machine (SVM)) and the absolute principal component scores-multiple linear regression (APCS-MLR) receptor model were used. The surface soil results showed that the average values of Cd, Hg, As, Pb, Cr, Cu, Ni, and Zn were 0.44, 0.18, 9.92, 32.3, 129, 100, 72.8, and 148 mg·kg<sup>-1</sup>. Combined profile soil data showed that Cd, Hg, As, Pb, Cr, Cu, Ni, and Zn were affected by human activities to varying degrees. Using machine learning algorithms analysis, RF was better than DT and SVM, and <i>R</i><sup>2</sup> values of Cd, Hg, As, Pb, Cr, Cu, Ni, and Zn were 0.783, 0.728, 0.528, 0.753, 0.753, 0.853, 0.822, and 0.756. \"The number of coal gangue units\" (<i>X</i><sub>1</sub>), \"the vertical height difference between the sampling point and coal gangue heap\" (<i>X</i><sub>2</sub>), and \"the distance between the sampling point and the coal gangue heap\" (<i>X</i><sub>3</sub>) were the key driving factors by human activities. Combined with APCS-MLR model analysis, the soil in the study area was affected by natural sources, mining sources, and mixed sources (including atmospheric deposition, agricultural production, life and traffic emissions, etc.), with contribution rates of 42.5%, 37.1%, and 20.4%, respectively. The combined application of the machine learning algorithms and receptor model can make the results of source apportionment more comprehensive, accurate, and reliable.</p>","PeriodicalId":35937,"journal":{"name":"环境科学","volume":"46 8","pages":"5229-5236"},"PeriodicalIF":0.0000,"publicationDate":"2025-08-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"环境科学","FirstCategoryId":"1087","ListUrlMain":"https://doi.org/10.13227/j.hjkx.202408149","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Environmental Science","Score":null,"Total":0}
引用次数: 0

Abstract

To analyze the source apportionment and influence factors of heavy metals in soils surrounding a coal gangue heap in Chongqing, three machine learning algorithms (decision tree (DT), random forest (RF), and support vector machine (SVM)) and the absolute principal component scores-multiple linear regression (APCS-MLR) receptor model were used. The surface soil results showed that the average values of Cd, Hg, As, Pb, Cr, Cu, Ni, and Zn were 0.44, 0.18, 9.92, 32.3, 129, 100, 72.8, and 148 mg·kg-1. Combined profile soil data showed that Cd, Hg, As, Pb, Cr, Cu, Ni, and Zn were affected by human activities to varying degrees. Using machine learning algorithms analysis, RF was better than DT and SVM, and R2 values of Cd, Hg, As, Pb, Cr, Cu, Ni, and Zn were 0.783, 0.728, 0.528, 0.753, 0.753, 0.853, 0.822, and 0.756. "The number of coal gangue units" (X1), "the vertical height difference between the sampling point and coal gangue heap" (X2), and "the distance between the sampling point and the coal gangue heap" (X3) were the key driving factors by human activities. Combined with APCS-MLR model analysis, the soil in the study area was affected by natural sources, mining sources, and mixed sources (including atmospheric deposition, agricultural production, life and traffic emissions, etc.), with contribution rates of 42.5%, 37.1%, and 20.4%, respectively. The combined application of the machine learning algorithms and receptor model can make the results of source apportionment more comprehensive, accurate, and reliable.

基于机器学习算法和受体模型的土壤重金属源解析[j]。
采用决策树(DT)、随机森林(RF)和支持向量机(SVM) 3种机器学习算法和绝对主成分评分-多元线性回归(APCS-MLR)受体模型,对重庆某煤矸石堆周边土壤重金属的来源分摊及影响因素进行了分析。表层土壤Cd、Hg、As、Pb、Cr、Cu、Ni、Zn的平均值分别为0.44、0.18、9.92、32.3、129、100、72.8和148 mg·kg-1。土壤Cd、Hg、As、Pb、Cr、Cu、Ni、Zn受到人类活动不同程度的影响。通过机器学习算法分析,RF优于DT和SVM, Cd、Hg、As、Pb、Cr、Cu、Ni、Zn的R2值分别为0.783、0.728、0.528、0.753、0.753、0.853、0.822、0.756。“煤矸石单位数”(X1)、“采样点与煤矸石堆的垂直高度差”(X2)、“采样点与煤矸石堆的距离”(X3)是人类活动的关键驱动因素。结合APCS-MLR模型分析,研究区土壤受自然源、采矿源和混合源(包括大气沉降、农业生产、生活和交通排放等)的影响,贡献率分别为42.5%、37.1%和20.4%。机器学习算法与受体模型的结合应用,可以使源分配结果更加全面、准确、可靠。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
环境科学
环境科学 Environmental Science-Environmental Science (all)
CiteScore
4.40
自引率
0.00%
发文量
15329
期刊介绍:
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信