Estimating soil organic carbon using sentinel-2 data under zero tillage agriculture: a machine learning approach

IF 2.7 4区 地球科学 Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
Lawrence Mango, Nuthammachot Narissara, Som-ard Jaturong
{"title":"Estimating soil organic carbon using sentinel-2 data under zero tillage agriculture: a machine learning approach","authors":"Lawrence Mango, Nuthammachot Narissara, Som-ard Jaturong","doi":"10.1007/s12145-024-01427-y","DOIUrl":null,"url":null,"abstract":"<p>Soil organic carbon (SOC) is the main component of soil organic matter (SOM) and constitutes the crucial component of the soil. It supports key soil functions, stabilizes soil structure, aid in plant-nutrient retention and release, and promote water infiltration and storage. Predicting SOC using Sentinel-2 data integrated with machine learning algorithms under zero tillage practice is inadequately documented for developing countries like Zimbabwe. The purpose of this study is to evaluate the performance of support vector machine (SVM), artificial neural network (ANN), and partial least square regression (PLSR) algorithms from Sentinel-2 data for SOC estimation. The SVM, ANN and PLSR models were used with a cross-validation to estimate the SOC content based on 50 georeferenced calibration samples under a zero-tillage practice. The ANN model outperformed the other two models by delivering a coefficient of determination (R<sup>2</sup>) of between 55 and 60% of SOC variability and RMSE varied between 5.01 and 8.78%, whereas for the SVM, R<sup>2</sup> varied between 0.53 and 0.57 and RMSE varied between 6.25 and 11.39%. The least estimates of SOC provided by the PLSR algorithm were, R<sup>2</sup> = 0.44–0.49 and RMSE = 7.59–12.42% for the top 15 cm depth. Results with and R<sup>2</sup>, root mean square error (RMSE) and mean absolute error (MAE) for SVM, ANN and PLSR, show that the ANN model is highly capable for capturing SOC variability. Although the ANN algorithm provides more accurate SOC estimates than the SVM algorithm, the difference in accuracy is not significant. Results revealed a satisfactory agreement between the SOC content and zero tillage practice (R<sup>2</sup>, coefficient of variation (CV), MAE, and RMSE using SVM, ANN and PLSR for the validation dataset using four predictor variables. The calibration results of SOC indicated that the mean SOC was 15.83% and the validation mean SOC was 17.02%. The SOC validation dataset (34.17%) had higher degree of variation around its mean as compared to the calibration dataset (29.86%). The SOC prediction results can be used as an important tool for informed decisions about soil health and productivity by the farmers, land managers and policy makers.</p>","PeriodicalId":49318,"journal":{"name":"Earth Science Informatics","volume":"117 1","pages":""},"PeriodicalIF":2.7000,"publicationDate":"2024-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Earth Science Informatics","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1007/s12145-024-01427-y","RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

Soil organic carbon (SOC) is the main component of soil organic matter (SOM) and constitutes the crucial component of the soil. It supports key soil functions, stabilizes soil structure, aid in plant-nutrient retention and release, and promote water infiltration and storage. Predicting SOC using Sentinel-2 data integrated with machine learning algorithms under zero tillage practice is inadequately documented for developing countries like Zimbabwe. The purpose of this study is to evaluate the performance of support vector machine (SVM), artificial neural network (ANN), and partial least square regression (PLSR) algorithms from Sentinel-2 data for SOC estimation. The SVM, ANN and PLSR models were used with a cross-validation to estimate the SOC content based on 50 georeferenced calibration samples under a zero-tillage practice. The ANN model outperformed the other two models by delivering a coefficient of determination (R2) of between 55 and 60% of SOC variability and RMSE varied between 5.01 and 8.78%, whereas for the SVM, R2 varied between 0.53 and 0.57 and RMSE varied between 6.25 and 11.39%. The least estimates of SOC provided by the PLSR algorithm were, R2 = 0.44–0.49 and RMSE = 7.59–12.42% for the top 15 cm depth. Results with and R2, root mean square error (RMSE) and mean absolute error (MAE) for SVM, ANN and PLSR, show that the ANN model is highly capable for capturing SOC variability. Although the ANN algorithm provides more accurate SOC estimates than the SVM algorithm, the difference in accuracy is not significant. Results revealed a satisfactory agreement between the SOC content and zero tillage practice (R2, coefficient of variation (CV), MAE, and RMSE using SVM, ANN and PLSR for the validation dataset using four predictor variables. The calibration results of SOC indicated that the mean SOC was 15.83% and the validation mean SOC was 17.02%. The SOC validation dataset (34.17%) had higher degree of variation around its mean as compared to the calibration dataset (29.86%). The SOC prediction results can be used as an important tool for informed decisions about soil health and productivity by the farmers, land managers and policy makers.

Abstract Image

利用哨兵-2 数据估算零耕作农业下的土壤有机碳:一种机器学习方法
土壤有机碳(SOC)是土壤有机质(SOM)的主要成分,也是土壤的重要组成部分。它支持土壤的主要功能,稳定土壤结构,有助于植物养分的保持和释放,并促进水分的渗透和储存。对于像津巴布韦这样的发展中国家来说,利用哨兵-2 数据与机器学习算法相结合,预测零耕作实践下的 SOC 的文献资料尚不充分。本研究的目的是评估支持向量机 (SVM)、人工神经网络 (ANN) 和偏最小二乘回归 (PLSR) 算法在利用哨兵-2 数据估算 SOC 方面的性能。利用 SVM、ANN 和 PLSR 模型进行交叉验证,根据 50 个地理参照校准样本估算出零耕法下的 SOC 含量。ANN 模型的 SOC 变异性决定系数 (R2) 在 55% 到 60% 之间,RMSE 在 5.01% 到 8.78% 之间;而 SVM 模型的 R2 在 0.53% 到 0.57% 之间,RMSE 在 6.25% 到 11.39% 之间。PLSR 算法对顶部 15 厘米深度 SOC 的最小估计值为 R2 = 0.44-0.49 和 RMSE = 7.59-12.42%。SVM、ANN 和 PLSR 的 R2、均方根误差(RMSE)和平均绝对误差(MAE)结果表明,ANN 模型能够很好地捕捉 SOC 的变化。虽然与 SVM 算法相比,ANN 算法能提供更准确的 SOC 估计值,但准确度的差异并不显著。结果显示,在使用四个预测变量的验证数据集上,使用 SVM、ANN 和 PLSR 得出的 SOC 含量与零耕作实践之间的 R2、变异系数 (CV)、MAE 和 RMSE 的一致性令人满意。SOC 的校准结果表明,平均 SOC 为 15.83%,验证平均 SOC 为 17.02%。与校准数据集(29.86%)相比,SOC 验证数据集(34.17%)围绕其平均值的变化程度更高。SOC 预测结果可作为农民、土地管理者和政策制定者就土壤健康和生产力做出明智决策的重要工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Earth Science Informatics
Earth Science Informatics COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS-GEOSCIENCES, MULTIDISCIPLINARY
CiteScore
4.60
自引率
3.60%
发文量
157
审稿时长
4.3 months
期刊介绍: The Earth Science Informatics [ESIN] journal aims at rapid publication of high-quality, current, cutting-edge, and provocative scientific work in the area of Earth Science Informatics as it relates to Earth systems science and space science. This includes articles on the application of formal and computational methods, computational Earth science, spatial and temporal analyses, and all aspects of computer applications to the acquisition, storage, processing, interchange, and visualization of data and information about the materials, properties, processes, features, and phenomena that occur at all scales and locations in the Earth system’s five components (atmosphere, hydrosphere, geosphere, biosphere, cryosphere) and in space (see "About this journal" for more detail). The quarterly journal publishes research, methodology, and software articles, as well as editorials, comments, and book and software reviews. Review articles of relevant findings, topics, and methodologies are also considered.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信