An Evaluation of Some Machine Learning Algorithms as Tools for Predicting Soil Characteristics Based on Their Spectral Response in the Vis‑NIR Range

Q3 Social Sciences
S. Gruszczyński
{"title":"An Evaluation of Some Machine Learning Algorithms as Tools for Predicting Soil Characteristics Based on Their Spectral Response in the Vis‑NIR Range","authors":"S. Gruszczyński","doi":"10.7494/GEOM.2021.15.1.63","DOIUrl":null,"url":null,"abstract":"Using the Land Use and Coverage Frame Survey (LUCAS) database of European soil surface layer properties, statistical and machine learning predictive models for several key soil characteristics (clay content, pH in CaCl2, concentration of organic carbon, calcium carbonates and nitrogen and exchange cations capacity) were compared on the basis of processing their spectral responses in the visible (Vis) and near‑infrared (NIR) parts. Standard methods of relationship modeling were used: stepwise regression, partial least squares regression and linear regression with input data obtained from principal components analysis. Using the inputs extracted by statistical algorithms various machine learning algorithms were used in the modeling. The usefulness of the models was analyzed by comparison with the values of the determination coefficients, the root mean square error and the distribution of residual values. The mean square error of estimation in the cross‑validation procedure for the stack model using the multilayer perceptron and the distributed random forest were as follows: for clay content – ca. 4.5%; for pH – ca. 0.35; for SOC – ca. 7.5 g/kg (0.75% by weight); for CaCO3 content – ca. 19 g/kg; for N content – ca. 0.50 g/kg; and for CEC – ca. 3.5 cmol(+)/kg.","PeriodicalId":36672,"journal":{"name":"Geomatics and Environmental Engineering","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2021-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geomatics and Environmental Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.7494/GEOM.2021.15.1.63","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"Social Sciences","Score":null,"Total":0}
引用次数: 0

Abstract

Using the Land Use and Coverage Frame Survey (LUCAS) database of European soil surface layer properties, statistical and machine learning predictive models for several key soil characteristics (clay content, pH in CaCl2, concentration of organic carbon, calcium carbonates and nitrogen and exchange cations capacity) were compared on the basis of processing their spectral responses in the visible (Vis) and near‑infrared (NIR) parts. Standard methods of relationship modeling were used: stepwise regression, partial least squares regression and linear regression with input data obtained from principal components analysis. Using the inputs extracted by statistical algorithms various machine learning algorithms were used in the modeling. The usefulness of the models was analyzed by comparison with the values of the determination coefficients, the root mean square error and the distribution of residual values. The mean square error of estimation in the cross‑validation procedure for the stack model using the multilayer perceptron and the distributed random forest were as follows: for clay content – ca. 4.5%; for pH – ca. 0.35; for SOC – ca. 7.5 g/kg (0.75% by weight); for CaCO3 content – ca. 19 g/kg; for N content – ca. 0.50 g/kg; and for CEC – ca. 3.5 cmol(+)/kg.
基于可见-近红外光谱响应预测土壤特征的一些机器学习算法的评估
利用欧洲土壤表层特性的土地利用和覆盖框架调查(LUCAS)数据库,在处理其在可见光(Vis)和近红外(NIR)部分的光谱响应的基础上,比较了几种关键土壤特征(粘土含量、CaCl2中的pH、有机碳、碳酸钙和氮的浓度以及交换阳离子容量)的统计和机器学习预测模型。使用了标准的关系建模方法:逐步回归、偏最小二乘回归和线性回归,输入数据来自主成分分析。使用统计算法提取的输入,在建模中使用了各种机器学习算法。通过与确定系数、均方根误差和残差分布的值进行比较,分析了模型的有用性。使用多层感知器和分布式随机森林的堆栈模型的交叉验证过程中的估计均方误差如下:对于粘土含量,约4.5%;pH–约0.35;SOC——约7.5 g/kg(0.75重量%);CaCO3含量–ca.19g/kg;氮含量——约0.50 g/kg;CEC–ca。3.5 cmol(+)/kg。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Geomatics and Environmental Engineering
Geomatics and Environmental Engineering Earth and Planetary Sciences-Computers in Earth Sciences
CiteScore
2.30
自引率
0.00%
发文量
27
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信