利用机器学习绘制郑州耕地表层土壤有机碳密度图。

IF 3.2 3区 环境科学与生态学 Q3 ENGINEERING, ENVIRONMENTAL
Hengliang Guo, Jinyang Wang, Dujuan Zhang, Jian Cui, Yonghao Yuan, Haoming Bao, Mengjiao Yang, Jiahui Guo, Feng Chen, Wenge Zhou, Gang Wu, Yang Guo, Haitao Wei, Baojin Qiao, Shan Zhao
{"title":"利用机器学习绘制郑州耕地表层土壤有机碳密度图。","authors":"Hengliang Guo, Jinyang Wang, Dujuan Zhang, Jian Cui, Yonghao Yuan, Haoming Bao, Mengjiao Yang, Jiahui Guo, Feng Chen, Wenge Zhou, Gang Wu, Yang Guo, Haitao Wei, Baojin Qiao, Shan Zhao","doi":"10.1007/s10653-024-02313-8","DOIUrl":null,"url":null,"abstract":"<p><p>Research on soil organic carbon (SOC) is crucial for improving soil carbon sinks and achieving the \"double-carbon\" goal. This study introduces ten auxiliary variables based on the data from a 2021 land quality survey in Zhengzhou and a multi-objective regional geochemical survey. It uses geostatistical ordinary kriging (OK) interpolation, as well as classical machine learning (ML) models, including random forest (RF) and support vector machine (SVM), to map soil organic carbon density (SOCD) in the topsoil layer (0 - 20 cm) of cultivated land. It partitions the sampling data to assess the generalization capability of the machine learning models, with Zhongmu County designated as an independent test set (dataset2) and the remaining data as the training set (dataset1). The three models are trained using dataset1, and the trained machine learning models are directly applied to dataset2 to evaluate and compare their generalization performance. The distribution of SOCD and SOCS in soils of various types and textures is analyzed using the optimal interpolation method. The results indicated that: (1) The average SOC densities predicted by OK interpolation, RF, and SVM are 3.70, 3.74, and 3.63 kg/m<sup>2</sup>, with test set precisions (R<sup>2</sup>) of 0.34, 0.60, and 0.81, respectively. (2) ML achieves a significantly higher predictive precision than traditional OK interpolation. The RF model's precision is 0.21 higher than the SVM model and more precise in estimating carbon stock. (3) When applied to the dataset2, the RF model exhibited superior generalization capabilities (R<sup>2</sup> = 0.52, MSE = 0.32) over the SVM model (R<sup>2</sup> = 0.32, MSE = 0.45). (4) The spatial distribution of surface SOCD in the study area exhibits a decreasing gradient from west to east and from south to north. The total carbon stock in the study area is estimated at approximately 10.76 × 10<sup>6</sup>t. (5) The integration of soil attribute variables, climatic variables, remote sensing data, and machine learning techniques holds significant promise for the high-precision and high-quality mapping of soil organic carbon density (SOCD) in agricultural soils.</p>","PeriodicalId":11759,"journal":{"name":"Environmental Geochemistry and Health","volume":"47 1","pages":"1"},"PeriodicalIF":3.2000,"publicationDate":"2024-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11604695/pdf/","citationCount":"0","resultStr":"{\"title\":\"Mapping surface soil organic carbon density of cultivated land using machine learning in Zhengzhou.\",\"authors\":\"Hengliang Guo, Jinyang Wang, Dujuan Zhang, Jian Cui, Yonghao Yuan, Haoming Bao, Mengjiao Yang, Jiahui Guo, Feng Chen, Wenge Zhou, Gang Wu, Yang Guo, Haitao Wei, Baojin Qiao, Shan Zhao\",\"doi\":\"10.1007/s10653-024-02313-8\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Research on soil organic carbon (SOC) is crucial for improving soil carbon sinks and achieving the \\\"double-carbon\\\" goal. This study introduces ten auxiliary variables based on the data from a 2021 land quality survey in Zhengzhou and a multi-objective regional geochemical survey. It uses geostatistical ordinary kriging (OK) interpolation, as well as classical machine learning (ML) models, including random forest (RF) and support vector machine (SVM), to map soil organic carbon density (SOCD) in the topsoil layer (0 - 20 cm) of cultivated land. It partitions the sampling data to assess the generalization capability of the machine learning models, with Zhongmu County designated as an independent test set (dataset2) and the remaining data as the training set (dataset1). The three models are trained using dataset1, and the trained machine learning models are directly applied to dataset2 to evaluate and compare their generalization performance. The distribution of SOCD and SOCS in soils of various types and textures is analyzed using the optimal interpolation method. The results indicated that: (1) The average SOC densities predicted by OK interpolation, RF, and SVM are 3.70, 3.74, and 3.63 kg/m<sup>2</sup>, with test set precisions (R<sup>2</sup>) of 0.34, 0.60, and 0.81, respectively. (2) ML achieves a significantly higher predictive precision than traditional OK interpolation. The RF model's precision is 0.21 higher than the SVM model and more precise in estimating carbon stock. (3) When applied to the dataset2, the RF model exhibited superior generalization capabilities (R<sup>2</sup> = 0.52, MSE = 0.32) over the SVM model (R<sup>2</sup> = 0.32, MSE = 0.45). (4) The spatial distribution of surface SOCD in the study area exhibits a decreasing gradient from west to east and from south to north. The total carbon stock in the study area is estimated at approximately 10.76 × 10<sup>6</sup>t. (5) The integration of soil attribute variables, climatic variables, remote sensing data, and machine learning techniques holds significant promise for the high-precision and high-quality mapping of soil organic carbon density (SOCD) in agricultural soils.</p>\",\"PeriodicalId\":11759,\"journal\":{\"name\":\"Environmental Geochemistry and Health\",\"volume\":\"47 1\",\"pages\":\"1\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2024-11-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11604695/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Environmental Geochemistry and Health\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://doi.org/10.1007/s10653-024-02313-8\",\"RegionNum\":3,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"ENGINEERING, ENVIRONMENTAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Geochemistry and Health","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1007/s10653-024-02313-8","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 0

摘要

土壤有机碳(SOC)研究对于提高土壤碳汇、实现 "双碳 "目标至关重要。本研究以郑州市 2021 年土地质量调查和多目标区域地球化学调查数据为基础,引入了 10 个辅助变量。该研究利用地质统计普通克里金(OK)插值法以及经典的机器学习(ML)模型,包括随机森林(RF)和支持向量机(SVM),绘制了耕地表土层(0 - 20 cm)的土壤有机碳密度(SOCD)图。为了评估机器学习模型的泛化能力,该研究对采样数据进行了分区,将中牟县作为独立测试集(数据集 2),其余数据作为训练集(数据集 1)。使用数据集 1 对三个模型进行训练,然后将训练好的机器学习模型直接应用于数据集 2,以评估和比较它们的泛化性能。利用最优插值法分析了 SOCD 和 SOCS 在不同类型和质地土壤中的分布。结果表明(1) OK 插值法、RF 和 SVM 预测的平均 SOC 密度分别为 3.70、3.74 和 3.63 kg/m2,测试集精度(R2)分别为 0.34、0.60 和 0.81。(2) ML 的预测精度明显高于传统的 OK 插值法。RF 模型的精度比 SVM 模型高 0.21,在估算碳储量方面更为精确。(3) 应用于数据集2 时,RF 模型的泛化能力(R2 = 0.52,MSE = 0.32)优于 SVM 模型(R2 = 0.32,MSE = 0.45)。(4) 研究区地表 SOCD 的空间分布呈现由西向东、由南向北递减的梯度。(5) 将土壤属性变量、气候变量、遥感数据和机器学习技术相结合,有望实现高精度、高质量的农田土壤有机碳密度(SOCD)测绘。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Mapping surface soil organic carbon density of cultivated land using machine learning in Zhengzhou.

Research on soil organic carbon (SOC) is crucial for improving soil carbon sinks and achieving the "double-carbon" goal. This study introduces ten auxiliary variables based on the data from a 2021 land quality survey in Zhengzhou and a multi-objective regional geochemical survey. It uses geostatistical ordinary kriging (OK) interpolation, as well as classical machine learning (ML) models, including random forest (RF) and support vector machine (SVM), to map soil organic carbon density (SOCD) in the topsoil layer (0 - 20 cm) of cultivated land. It partitions the sampling data to assess the generalization capability of the machine learning models, with Zhongmu County designated as an independent test set (dataset2) and the remaining data as the training set (dataset1). The three models are trained using dataset1, and the trained machine learning models are directly applied to dataset2 to evaluate and compare their generalization performance. The distribution of SOCD and SOCS in soils of various types and textures is analyzed using the optimal interpolation method. The results indicated that: (1) The average SOC densities predicted by OK interpolation, RF, and SVM are 3.70, 3.74, and 3.63 kg/m2, with test set precisions (R2) of 0.34, 0.60, and 0.81, respectively. (2) ML achieves a significantly higher predictive precision than traditional OK interpolation. The RF model's precision is 0.21 higher than the SVM model and more precise in estimating carbon stock. (3) When applied to the dataset2, the RF model exhibited superior generalization capabilities (R2 = 0.52, MSE = 0.32) over the SVM model (R2 = 0.32, MSE = 0.45). (4) The spatial distribution of surface SOCD in the study area exhibits a decreasing gradient from west to east and from south to north. The total carbon stock in the study area is estimated at approximately 10.76 × 106t. (5) The integration of soil attribute variables, climatic variables, remote sensing data, and machine learning techniques holds significant promise for the high-precision and high-quality mapping of soil organic carbon density (SOCD) in agricultural soils.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Environmental Geochemistry and Health
Environmental Geochemistry and Health 环境科学-工程:环境
CiteScore
8.00
自引率
4.80%
发文量
279
审稿时长
4.2 months
期刊介绍: Environmental Geochemistry and Health publishes original research papers and review papers across the broad field of environmental geochemistry. Environmental geochemistry and health establishes and explains links between the natural or disturbed chemical composition of the earth’s surface and the health of plants, animals and people. Beneficial elements regulate or promote enzymatic and hormonal activity whereas other elements may be toxic. Bedrock geochemistry controls the composition of soil and hence that of water and vegetation. Environmental issues, such as pollution, arising from the extraction and use of mineral resources, are discussed. The effects of contaminants introduced into the earth’s geochemical systems are examined. Geochemical surveys of soil, water and plants show how major and trace elements are distributed geographically. Associated epidemiological studies reveal the possibility of causal links between the natural or disturbed geochemical environment and disease. Experimental research illuminates the nature or consequences of natural or disturbed geochemical processes. The journal particularly welcomes novel research linking environmental geochemistry and health issues on such topics as: heavy metals (including mercury), persistent organic pollutants (POPs), and mixed chemicals emitted through human activities, such as uncontrolled recycling of electronic-waste; waste recycling; surface-atmospheric interaction processes (natural and anthropogenic emissions, vertical transport, deposition, and physical-chemical interaction) of gases and aerosols; phytoremediation/restoration of contaminated sites; food contamination and safety; environmental effects of medicines; effects and toxicity of mixed pollutants; speciation of heavy metals/metalloids; effects of mining; disturbed geochemistry from human behavior, natural or man-made hazards; particle and nanoparticle toxicology; risk and the vulnerability of populations, etc.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信