An ensemble estimate of Australian soil organic carbon using machine learning and process-based modelling

IF 5.8 2区 农林科学 Q1 SOIL SCIENCE
Soil Pub Date : 2024-09-10 DOI:10.5194/soil-10-619-2024
Lingfei Wang, Gab Abramowitz, Ying-Ping Wang, Andy Pitman, Raphael A. Viscarra Rossel
{"title":"An ensemble estimate of Australian soil organic carbon using machine learning and process-based modelling","authors":"Lingfei Wang, Gab Abramowitz, Ying-Ping Wang, Andy Pitman, Raphael A. Viscarra Rossel","doi":"10.5194/soil-10-619-2024","DOIUrl":null,"url":null,"abstract":"Abstract. Spatially explicit prediction of soil organic carbon (SOC) serves as a crucial foundation for effective land management strategies aimed at mitigating soil degradation and assessing carbon sequestration potential. Here, using more than 1000 in situ observations, we trained two machine learning models (a random forest model and a k-means coupled with multiple linear regression model) and one process-based model (the vertically resolved MIcrobial-MIneral Carbon Stabilization, MIMICS, model) to predict the SOC stocks of the top 30 cm of soil in Australia. Parameters of MIMICS were optimised for different site groupings using two distinct approaches: plant functional types (MIMICS-PFT) and the most influential environmental factors (MIMICS-ENV). All models showed good performance with respect to SOC predictions, with an R2 value greater than 0.8 during out-of-sample validation, with random forest being the most accurate; moreover, it was found that SOC in forests is more predictable than that in non-forest soils excluding croplands. The performance of continental-scale SOC predictions by MIMICS-ENV is better than that by MIMICS-PFT especially in non-forest soils. Digital maps of terrestrial SOC stocks generated using all of the models showed a similar spatial distribution, with higher values in south-eastern and south-western Australia, but the magnitude of the estimated SOC stocks varied. The mean ensemble estimate of SOC stocks was 30.3 t ha−1, with k-means coupled with multiple linear regression generating the highest estimate (mean SOC stocks of 38.15 t ha−1) and MIMICS-PFT generating the lowest estimate (mean SOC stocks of 24.29 t ha−1). We suggest that enhancing process-based models to incorporate newly identified drivers that significantly influence SOC variation in different environments could be the key to reducing the discrepancies in these estimates. Our findings underscore the considerable uncertainty in SOC estimates derived from different modelling approaches and emphasise the importance of rigorous out-of-sample validation before applying any one approach in Australia.","PeriodicalId":48610,"journal":{"name":"Soil","volume":"2015 1","pages":""},"PeriodicalIF":5.8000,"publicationDate":"2024-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Soil","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.5194/soil-10-619-2024","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SOIL SCIENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Abstract. Spatially explicit prediction of soil organic carbon (SOC) serves as a crucial foundation for effective land management strategies aimed at mitigating soil degradation and assessing carbon sequestration potential. Here, using more than 1000 in situ observations, we trained two machine learning models (a random forest model and a k-means coupled with multiple linear regression model) and one process-based model (the vertically resolved MIcrobial-MIneral Carbon Stabilization, MIMICS, model) to predict the SOC stocks of the top 30 cm of soil in Australia. Parameters of MIMICS were optimised for different site groupings using two distinct approaches: plant functional types (MIMICS-PFT) and the most influential environmental factors (MIMICS-ENV). All models showed good performance with respect to SOC predictions, with an R2 value greater than 0.8 during out-of-sample validation, with random forest being the most accurate; moreover, it was found that SOC in forests is more predictable than that in non-forest soils excluding croplands. The performance of continental-scale SOC predictions by MIMICS-ENV is better than that by MIMICS-PFT especially in non-forest soils. Digital maps of terrestrial SOC stocks generated using all of the models showed a similar spatial distribution, with higher values in south-eastern and south-western Australia, but the magnitude of the estimated SOC stocks varied. The mean ensemble estimate of SOC stocks was 30.3 t ha−1, with k-means coupled with multiple linear regression generating the highest estimate (mean SOC stocks of 38.15 t ha−1) and MIMICS-PFT generating the lowest estimate (mean SOC stocks of 24.29 t ha−1). We suggest that enhancing process-based models to incorporate newly identified drivers that significantly influence SOC variation in different environments could be the key to reducing the discrepancies in these estimates. Our findings underscore the considerable uncertainty in SOC estimates derived from different modelling approaches and emphasise the importance of rigorous out-of-sample validation before applying any one approach in Australia.
利用机器学习和基于过程的建模对澳大利亚土壤有机碳进行集合估算
摘要土壤有机碳(SOC)的空间明确预测是有效土地管理战略的重要基础,旨在缓解土壤退化和评估固碳潜力。在此,我们利用 1000 多个原位观测数据,训练了两个机器学习模型(随机森林模型和 k-means 耦合多元线性回归模型)和一个基于过程的模型(垂直解析的微生物-矿物碳稳定模型 MIMICS),以预测澳大利亚土壤顶部 30 厘米的 SOC 储量。利用两种不同的方法:植物功能类型(MIMICS-PFT)和最有影响的环境因素(MIMICS-ENV),对不同地点分组的 MIMICS 参数进行了优化。所有模型在 SOC 预测方面都表现良好,在样本外验证中 R2 值大于 0.8,其中随机森林最为准确;此外,研究还发现森林中的 SOC 比非森林土壤(不包括耕地)中的 SOC 更可预测。MIMICS-ENV 的大陆尺度 SOC 预测性能优于 MIMICS-PFT,尤其是在非森林土壤中。利用所有模型生成的陆地 SOC 储量数字地图显示出相似的空间分布,澳大利亚东南部和西南部的数值较高,但估计的 SOC 储量大小不一。SOC 储量的平均集合估计值为 30.3 吨/公顷,其中 k-means 结合多元线性回归得出的估计值最高(SOC 储量平均值为 38.15 吨/公顷),而 MIMICS-PFT 得出的估计值最低(SOC 储量平均值为 24.29 吨/公顷)。我们认为,加强基于过程的模型,纳入新发现的对不同环境中 SOC 变化有显著影响的驱动因素,可能是减少这些估计值差异的关键。我们的研究结果强调了不同建模方法得出的 SOC 估算值存在很大的不确定性,并强调了在澳大利亚应用任何一种方法之前进行严格的样本外验证的重要性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Soil
Soil Agricultural and Biological Sciences-Soil Science
CiteScore
10.80
自引率
2.90%
发文量
44
审稿时长
30 weeks
期刊介绍: SOIL is an international scientific journal dedicated to the publication and discussion of high-quality research in the field of soil system sciences. SOIL is at the interface between the atmosphere, lithosphere, hydrosphere, and biosphere. SOIL publishes scientific research that contributes to understanding the soil system and its interaction with humans and the entire Earth system. The scope of the journal includes all topics that fall within the study of soil science as a discipline, with an emphasis on studies that integrate soil science with other sciences (hydrology, agronomy, socio-economics, health sciences, atmospheric sciences, etc.).
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信