利用机器学习和博弈论预测和解释热带泥炭栽培中的总氮及其关键驱动因素

Heru Bagus Pulunggono, Yusuf Azmi Madani Madani, Lina Lathifah Nurazizah, Moh Zulfajrin
{"title":"利用机器学习和博弈论预测和解释热带泥炭栽培中的总氮及其关键驱动因素","authors":"Heru Bagus Pulunggono, Yusuf Azmi Madani Madani, Lina Lathifah Nurazizah, Moh Zulfajrin","doi":"10.52045/jca.v4i1.592","DOIUrl":null,"url":null,"abstract":"Currently, there is a growing interest among research communities in the development of statistical learning-based pedotransfer functions/PtFs to predict mineral soil nutrients; however, similar studies in peatlands are relatively rare. Moreover, extracting meaningful information from these ‘black-box’ models is crucial, particularly concerning their algorithmic complexity and the non-linear nature of the soil covariate interrelationships. This study employed the Pulunggono (2022a) dataset and the bootstrapping method, to (1) develop and evaluate seven PtF models, including both general linear models (GLM) and machine learning (ML) regressors for estimating total nitrogen (N) in tropical peat that has been drained and cultivated for oil palm (OP) in Riau, Indonesia and (2) explaining model functioning by incorporating Shapley Additive Explanation (SHAP), a tool derived from coalitional game theory. This study demonstrated the superior predictive performance of ML-based PtFs in estimating total N compared to GLM algorithms. The top-performing algorithms for PtF models were identified as GBM, XGB, and Cubist. The SHAP method revealed that sampling depth and organic C were consistently identified as the most important covariates across all models, irrespective of their algorithmic capabilities. Additionally, ML algorithms identified the total Fe, pH, and bulk density (BD) as significant covariates. Local explanations based on Shapley values indicated that the behavior of PtF-based algorithms diverged from their global explanations. This study emphasized the critical role of ML algorithms and game theory in accurately predicting total N in peatlands subjected to drainage and cultivation for OP and explaining their model behavior in relation to soil biogeochemical processes.","PeriodicalId":9663,"journal":{"name":"CELEBES Agricultural","volume":"114 5","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Prediction and Interpretation of Total N and Its Key Drivers in Cultivated Tropical Peat using Machine Learning and Game Theory\",\"authors\":\"Heru Bagus Pulunggono, Yusuf Azmi Madani Madani, Lina Lathifah Nurazizah, Moh Zulfajrin\",\"doi\":\"10.52045/jca.v4i1.592\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Currently, there is a growing interest among research communities in the development of statistical learning-based pedotransfer functions/PtFs to predict mineral soil nutrients; however, similar studies in peatlands are relatively rare. Moreover, extracting meaningful information from these ‘black-box’ models is crucial, particularly concerning their algorithmic complexity and the non-linear nature of the soil covariate interrelationships. This study employed the Pulunggono (2022a) dataset and the bootstrapping method, to (1) develop and evaluate seven PtF models, including both general linear models (GLM) and machine learning (ML) regressors for estimating total nitrogen (N) in tropical peat that has been drained and cultivated for oil palm (OP) in Riau, Indonesia and (2) explaining model functioning by incorporating Shapley Additive Explanation (SHAP), a tool derived from coalitional game theory. This study demonstrated the superior predictive performance of ML-based PtFs in estimating total N compared to GLM algorithms. The top-performing algorithms for PtF models were identified as GBM, XGB, and Cubist. The SHAP method revealed that sampling depth and organic C were consistently identified as the most important covariates across all models, irrespective of their algorithmic capabilities. Additionally, ML algorithms identified the total Fe, pH, and bulk density (BD) as significant covariates. Local explanations based on Shapley values indicated that the behavior of PtF-based algorithms diverged from their global explanations. This study emphasized the critical role of ML algorithms and game theory in accurately predicting total N in peatlands subjected to drainage and cultivation for OP and explaining their model behavior in relation to soil biogeochemical processes.\",\"PeriodicalId\":9663,\"journal\":{\"name\":\"CELEBES Agricultural\",\"volume\":\"114 5\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-12-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"CELEBES Agricultural\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.52045/jca.v4i1.592\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"CELEBES Agricultural","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.52045/jca.v4i1.592","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

目前,研究界对开发基于统计学习的 pedotransfer 函数/PtFs 来预测矿质土壤养分的兴趣日益浓厚;然而,泥炭地的类似研究却相对较少。此外,从这些 "黑箱 "模型中提取有意义的信息至关重要,尤其是在其算法复杂性和土壤协变量相互关系的非线性性质方面。本研究采用了 Pulunggono(2022a)数据集和引导法(bootstrapping method),(1) 开发和评估了七个 PtF 模型,包括一般线性模型(GLM)和机器学习(ML)回归因子,用于估算印度尼西亚廖内省已排水并用于油棕(OP)种植的热带泥炭中的总氮(N);(2) 通过结合 Shapley Additive Explanation(SHAP)(一种源自联盟博弈论的工具)来解释模型的功能。这项研究表明,与 GLM 算法相比,基于 ML 的 PtF 在估计总氮方面具有更优越的预测性能。PtF 模型中表现最好的算法是 GBM、XGB 和 Cubist。SHAP 方法显示,无论算法能力如何,取样深度和有机碳始终是所有模型中最重要的协变量。此外,ML 算法还将总铁、pH 值和容积密度(BD)确定为重要的协变量。基于 Shapley 值的局部解释表明,基于 PtF 算法的行为与其全局解释存在差异。这项研究强调了 ML 算法和博弈论在准确预测泥炭地排水和耕作 OP 总氮以及解释其与土壤生物地球化学过程相关的模型行为中的关键作用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Prediction and Interpretation of Total N and Its Key Drivers in Cultivated Tropical Peat using Machine Learning and Game Theory
Currently, there is a growing interest among research communities in the development of statistical learning-based pedotransfer functions/PtFs to predict mineral soil nutrients; however, similar studies in peatlands are relatively rare. Moreover, extracting meaningful information from these ‘black-box’ models is crucial, particularly concerning their algorithmic complexity and the non-linear nature of the soil covariate interrelationships. This study employed the Pulunggono (2022a) dataset and the bootstrapping method, to (1) develop and evaluate seven PtF models, including both general linear models (GLM) and machine learning (ML) regressors for estimating total nitrogen (N) in tropical peat that has been drained and cultivated for oil palm (OP) in Riau, Indonesia and (2) explaining model functioning by incorporating Shapley Additive Explanation (SHAP), a tool derived from coalitional game theory. This study demonstrated the superior predictive performance of ML-based PtFs in estimating total N compared to GLM algorithms. The top-performing algorithms for PtF models were identified as GBM, XGB, and Cubist. The SHAP method revealed that sampling depth and organic C were consistently identified as the most important covariates across all models, irrespective of their algorithmic capabilities. Additionally, ML algorithms identified the total Fe, pH, and bulk density (BD) as significant covariates. Local explanations based on Shapley values indicated that the behavior of PtF-based algorithms diverged from their global explanations. This study emphasized the critical role of ML algorithms and game theory in accurately predicting total N in peatlands subjected to drainage and cultivation for OP and explaining their model behavior in relation to soil biogeochemical processes.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信