Evaluation of soil quality of cultivated lands with classification and regression-based machine learning algorithms optimization under humid environmental condition

IF 2.8 3区 地球科学 Q2 ASTRONOMY & ASTROPHYSICS
Orhan Dengiz , Pelin Alaboz , Fikret Saygın , Kemal Adem , Emre Yüksek
{"title":"Evaluation of soil quality of cultivated lands with classification and regression-based machine learning algorithms optimization under humid environmental condition","authors":"Orhan Dengiz ,&nbsp;Pelin Alaboz ,&nbsp;Fikret Saygın ,&nbsp;Kemal Adem ,&nbsp;Emre Yüksek","doi":"10.1016/j.asr.2024.08.048","DOIUrl":null,"url":null,"abstract":"<div><div>In soil science, machine learning algorithms are preferred for pedotransfer functions due to their rapid data acquisition and high prediction accuracy. The current study aims to evaluate the prediction of soil quality in agricultural lands dominated by the humid Black Sea climate using various algorithms. Both classification and regression-based algorithms (Random Forest-RF, Light Gradient Boosting-LGB, Extreme Gradient Boosting-XGBoost, k-nearest neighbors-kNN, Logistic Regression, multilayer perceptron-MLP, Linear Regression-LR and Bayesian Ridge- BR) were used in the method. The comparison of soil maps is also included. Furthermore, the present study evaluates the Grid Search optimization method with K-Fold Cross Validation (K = 5) for both classification and regression-based algorithms. The prediction of soil quality was performed using class-based and regression-based algorithms. As a result of the study, the RF and XGBoost algorithms achieved an approximate accuracy rate of 92 % in the class-based prediction. In regression-based predictions, the most successful algorithms were BR and LR, with an R<sup>2</sup> Score of 0.84. The Grid Search optimization method was used to improve the R<sup>2</sup> Score, resulting in an increase to 0.90 and 0.88 for BR and LR, respectively. The optimized hyperparameters showed improved performance in predicting the soil quality index. The present study found that Gaussian and Spherical models had the lowest prediction errors in spatial distribution maps. Tree-based algorithms were found to be suitable for class-based prediction of soil quality, while the linear regression method was appropriate for regression predictions. This study is characterized by a rainy climate resulting in acidic soils with high organic matter content. Planning of new studies in different climates and soil properties is recommended.</div></div>","PeriodicalId":50850,"journal":{"name":"Advances in Space Research","volume":"74 11","pages":"Pages 5514-5529"},"PeriodicalIF":2.8000,"publicationDate":"2024-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in Space Research","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S027311772400872X","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ASTRONOMY & ASTROPHYSICS","Score":null,"Total":0}
引用次数: 0

Abstract

In soil science, machine learning algorithms are preferred for pedotransfer functions due to their rapid data acquisition and high prediction accuracy. The current study aims to evaluate the prediction of soil quality in agricultural lands dominated by the humid Black Sea climate using various algorithms. Both classification and regression-based algorithms (Random Forest-RF, Light Gradient Boosting-LGB, Extreme Gradient Boosting-XGBoost, k-nearest neighbors-kNN, Logistic Regression, multilayer perceptron-MLP, Linear Regression-LR and Bayesian Ridge- BR) were used in the method. The comparison of soil maps is also included. Furthermore, the present study evaluates the Grid Search optimization method with K-Fold Cross Validation (K = 5) for both classification and regression-based algorithms. The prediction of soil quality was performed using class-based and regression-based algorithms. As a result of the study, the RF and XGBoost algorithms achieved an approximate accuracy rate of 92 % in the class-based prediction. In regression-based predictions, the most successful algorithms were BR and LR, with an R2 Score of 0.84. The Grid Search optimization method was used to improve the R2 Score, resulting in an increase to 0.90 and 0.88 for BR and LR, respectively. The optimized hyperparameters showed improved performance in predicting the soil quality index. The present study found that Gaussian and Spherical models had the lowest prediction errors in spatial distribution maps. Tree-based algorithms were found to be suitable for class-based prediction of soil quality, while the linear regression method was appropriate for regression predictions. This study is characterized by a rainy climate resulting in acidic soils with high organic matter content. Planning of new studies in different climates and soil properties is recommended.
用基于分类和回归的机器学习算法优化评估潮湿环境条件下耕地的土壤质量
在土壤科学领域,机器学习算法因其数据获取速度快、预测准确度高等优点,成为 pedotransfer 功能的首选。目前的研究旨在利用各种算法评估以黑海湿润气候为主的农业用地的土壤质量预测。该方法使用了基于分类和回归的算法(随机森林-RF、轻梯度提升-LGB、极梯度提升-XGBoost、k-近邻-kNN、逻辑回归、多层感知器-MLP、线性回归-LR 和贝叶斯山脊-BR)。本研究还对土壤地图进行了比较。此外,本研究还对基于分类和回归算法的网格搜索优化方法进行了 K-Fold 交叉验证(K = 5)评估。使用基于分类和基于回归的算法对土壤质量进行了预测。研究结果表明,RF 算法和 XGBoost 算法的分类预测准确率约为 92%。在基于回归的预测中,最成功的算法是 BR 和 LR,R 得分为 0.84。采用网格搜索优化方法提高了 R Score,使 BR 和 LR 的 R Score 分别提高到 0.90 和 0.88。优化后的超参数在预测土壤质量指数方面的性能有所提高。本研究发现,高斯模型和球形模型在空间分布图中的预测误差最小。研究发现,基于树的算法适用于基于类别的土壤质量预测,而线性回归方法适用于回归预测。这项研究的特点是多雨气候,导致土壤呈酸性,有机质含量高。建议在不同气候和土壤特性条件下规划新的研究。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Advances in Space Research
Advances in Space Research 地学天文-地球科学综合
CiteScore
5.20
自引率
11.50%
发文量
800
审稿时长
5.8 months
期刊介绍: The COSPAR publication Advances in Space Research (ASR) is an open journal covering all areas of space research including: space studies of the Earth''s surface, meteorology, climate, the Earth-Moon system, planets and small bodies of the solar system, upper atmospheres, ionospheres and magnetospheres of the Earth and planets including reference atmospheres, space plasmas in the solar system, astrophysics from space, materials sciences in space, fundamental physics in space, space debris, space weather, Earth observations of space phenomena, etc. NB: Please note that manuscripts related to life sciences as related to space are no more accepted for submission to Advances in Space Research. Such manuscripts should now be submitted to the new COSPAR Journal Life Sciences in Space Research (LSSR). All submissions are reviewed by two scientists in the field. COSPAR is an interdisciplinary scientific organization concerned with the progress of space research on an international scale. Operating under the rules of ICSU, COSPAR ignores political considerations and considers all questions solely from the scientific viewpoint.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信