聚类方法在数字土壤制图中的应用——以土壤质地区划为例

4区 农林科学 Q2 Agricultural and Biological Sciences
Soil Science Pub Date : 2021-04-28 DOI:10.5194/SOIL-2020-102
I. Dunkl, Mareike Ließ
{"title":"聚类方法在数字土壤制图中的应用——以土壤质地区划为例","authors":"I. Dunkl, Mareike Ließ","doi":"10.5194/SOIL-2020-102","DOIUrl":null,"url":null,"abstract":"Abstract. High resolution soil maps are urgently needed by land managers and researchers for a variety of applications. Digital Soil Mapping (DSM) allows to regionalize soil properties by relating them to environmental covariates with the help of an empirical model. In this study, a legacy soil data set was used to train a machine learning algorithm in order to predict the particle size distribution within the catchment of the Bode river in Saxony-Anhalt (Germany). The ensemble learning method random forest was used to predict soil texture based on environmental covariates originating from a digital elevation model, land cover data and geologic maps. We studied the usefulness of clustering applications in addressing various aspects of the DSM procedure. To investigate the role of the imbalanced data problem in the learning process, the environmental variables were used to cluster the landscape of the study area. Different sampling strategies were used to create balanced training data and were evaluated on their ability to improve model performance. Clustering applications were also involved in feature selection and stratified cross-validation. Overall, clustering applications appear to be a versatile tool to be employed at various steps of the DSM procedure. Beyond their successful application, further application fields in DSM were identified. One of them is to find adequate means to include expert knowledge.\n","PeriodicalId":22015,"journal":{"name":"Soil Science","volume":"73 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2021-04-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"On the benefits of clustering approaches in digital soil mapping: an application example concerning soil texture regionalization\",\"authors\":\"I. Dunkl, Mareike Ließ\",\"doi\":\"10.5194/SOIL-2020-102\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract. High resolution soil maps are urgently needed by land managers and researchers for a variety of applications. Digital Soil Mapping (DSM) allows to regionalize soil properties by relating them to environmental covariates with the help of an empirical model. In this study, a legacy soil data set was used to train a machine learning algorithm in order to predict the particle size distribution within the catchment of the Bode river in Saxony-Anhalt (Germany). The ensemble learning method random forest was used to predict soil texture based on environmental covariates originating from a digital elevation model, land cover data and geologic maps. We studied the usefulness of clustering applications in addressing various aspects of the DSM procedure. To investigate the role of the imbalanced data problem in the learning process, the environmental variables were used to cluster the landscape of the study area. Different sampling strategies were used to create balanced training data and were evaluated on their ability to improve model performance. Clustering applications were also involved in feature selection and stratified cross-validation. Overall, clustering applications appear to be a versatile tool to be employed at various steps of the DSM procedure. Beyond their successful application, further application fields in DSM were identified. One of them is to find adequate means to include expert knowledge.\\n\",\"PeriodicalId\":22015,\"journal\":{\"name\":\"Soil Science\",\"volume\":\"73 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-04-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Soil Science\",\"FirstCategoryId\":\"97\",\"ListUrlMain\":\"https://doi.org/10.5194/SOIL-2020-102\",\"RegionNum\":4,\"RegionCategory\":\"农林科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"Agricultural and Biological Sciences\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Soil Science","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.5194/SOIL-2020-102","RegionNum":4,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Agricultural and Biological Sciences","Score":null,"Total":0}
引用次数: 1

摘要

摘要高分辨率土壤地图是土地管理者和研究人员迫切需要的各种应用。数字土壤制图(DSM)允许在经验模型的帮助下将土壤属性与环境协变量联系起来,从而对土壤属性进行区域化。在本研究中,使用遗留土壤数据集来训练机器学习算法,以预测萨克森-安哈尔特州(德国)博德河流域的粒度分布。基于数字高程模型、土地覆盖数据和地质图的环境协变量,采用集成学习方法随机森林对土壤质地进行预测。我们研究了聚类应用程序在解决DSM程序的各个方面的有用性。为了探讨数据不平衡问题在学习过程中的作用,采用环境变量对研究区景观进行聚类。使用不同的采样策略来创建平衡的训练数据,并评估其提高模型性能的能力。聚类应用还涉及特征选择和分层交叉验证。总的来说,集群应用程序似乎是一个通用的工具,可以在DSM过程的各个步骤中使用。除了它们的成功应用之外,还确定了DSM的进一步应用领域。其中之一是找到适当的方法来包含专业知识。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
On the benefits of clustering approaches in digital soil mapping: an application example concerning soil texture regionalization
Abstract. High resolution soil maps are urgently needed by land managers and researchers for a variety of applications. Digital Soil Mapping (DSM) allows to regionalize soil properties by relating them to environmental covariates with the help of an empirical model. In this study, a legacy soil data set was used to train a machine learning algorithm in order to predict the particle size distribution within the catchment of the Bode river in Saxony-Anhalt (Germany). The ensemble learning method random forest was used to predict soil texture based on environmental covariates originating from a digital elevation model, land cover data and geologic maps. We studied the usefulness of clustering applications in addressing various aspects of the DSM procedure. To investigate the role of the imbalanced data problem in the learning process, the environmental variables were used to cluster the landscape of the study area. Different sampling strategies were used to create balanced training data and were evaluated on their ability to improve model performance. Clustering applications were also involved in feature selection and stratified cross-validation. Overall, clustering applications appear to be a versatile tool to be employed at various steps of the DSM procedure. Beyond their successful application, further application fields in DSM were identified. One of them is to find adequate means to include expert knowledge.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Soil Science
Soil Science 农林科学-土壤科学
CiteScore
2.70
自引率
0.00%
发文量
0
审稿时长
4.4 months
期刊介绍: Cessation.Soil Science satisfies the professional needs of all scientists and laboratory personnel involved in soil and plant research by publishing primary research reports and critical reviews of basic and applied soil science, especially as it relates to soil and plant studies and general environmental soil science. Each month, Soil Science presents authoritative research articles from an impressive array of discipline: soil chemistry and biochemistry, physics, fertility and nutrition, soil genesis and morphology, soil microbiology and mineralogy. Of immediate relevance to soil scientists-both industrial and academic-this unique publication also has long-range value for agronomists and environmental scientists.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信