An ensemble estimate of Australian soil organic carbon using machine learning and process-based modelling

IF 5.8 2区 农林科学 Q1 SOIL SCIENCE
Soil Pub Date : 2024-01-22 DOI:10.5194/egusphere-2023-3016
Lingfei Wang, Gab Abramowitz, Ying-Ping Wang, Andy Pitman, Raphael Viscarra Rossel
{"title":"An ensemble estimate of Australian soil organic carbon using machine learning and process-based modelling","authors":"Lingfei Wang, Gab Abramowitz, Ying-Ping Wang, Andy Pitman, Raphael Viscarra Rossel","doi":"10.5194/egusphere-2023-3016","DOIUrl":null,"url":null,"abstract":"<strong>Abstract.</strong> Spatially explicit prediction of soil organic carbon (SOC) serves as a crucial foundation for effective land management strategies aimed at mitigating soil degradation and assessing carbon sequestration potential. Here, using more than 1000 in-situ observations, we trained two machine learning models (random forest, and K-means coupled with multiple linear regression), and one process-based model (the vertically resolved MIcrobial-MIneral Carbon Stabilization (MIMICS)) to predict SOC content of the top 30 cm of soil in Australia. Parameters of MIMICS were optimized for different site groupings, using two distinct approaches, plant functional types (MIMICS-PFT), and the most influential environmental factors (MIMICS-ENV). We found that at the continental scale, soil bulk density and mean annual temperature are the dominant controls of SOC variation, and that dominant controls vary for different vegetation types. All models showed good performance in SOC predictions with R<sup>2</sup> greater than 0.8 during out-of-sample validation with random forest being the most accurate, and SOC in forests is more predictable than that in non-forest soils. Parameter optimization approaches made a notable difference in the performance of MIMICS SOC prediction with MIMICS-ENV performing better than MIMICS-PFT especially in non-forest soils. Digital maps of terrestrial SOC stocks generated using all the models showed similar spatial distribution with higher values in southeast and southwest Australia, but the magnitude of estimated SOC stocks varied. The mean ensemble estimate of SOC stocks was 30.08 t/ha with K-means coupled with multiple linear regression generating the highest estimate (mean SOC stocks at 38.15 t/ha) and MIMICS-PFT generating the lowest estimate (mean SOC stocks at 24.29 t/ha). We suggest that enhancing process-based models to incorporate newly identified drivers that significantly influence SOC variations in different environments could be key to reducing the discrepancies in these estimates. Our findings underscore the considerable uncertainty in SOC estimates derived from different modelling approaches and emphasize the importance of rigorous out-of-sample validation before applying any one approach in Australia.","PeriodicalId":48610,"journal":{"name":"Soil","volume":"11 8 1","pages":""},"PeriodicalIF":5.8000,"publicationDate":"2024-01-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Soil","FirstCategoryId":"97","ListUrlMain":"https://doi.org/10.5194/egusphere-2023-3016","RegionNum":2,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"SOIL SCIENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Abstract. Spatially explicit prediction of soil organic carbon (SOC) serves as a crucial foundation for effective land management strategies aimed at mitigating soil degradation and assessing carbon sequestration potential. Here, using more than 1000 in-situ observations, we trained two machine learning models (random forest, and K-means coupled with multiple linear regression), and one process-based model (the vertically resolved MIcrobial-MIneral Carbon Stabilization (MIMICS)) to predict SOC content of the top 30 cm of soil in Australia. Parameters of MIMICS were optimized for different site groupings, using two distinct approaches, plant functional types (MIMICS-PFT), and the most influential environmental factors (MIMICS-ENV). We found that at the continental scale, soil bulk density and mean annual temperature are the dominant controls of SOC variation, and that dominant controls vary for different vegetation types. All models showed good performance in SOC predictions with R2 greater than 0.8 during out-of-sample validation with random forest being the most accurate, and SOC in forests is more predictable than that in non-forest soils. Parameter optimization approaches made a notable difference in the performance of MIMICS SOC prediction with MIMICS-ENV performing better than MIMICS-PFT especially in non-forest soils. Digital maps of terrestrial SOC stocks generated using all the models showed similar spatial distribution with higher values in southeast and southwest Australia, but the magnitude of estimated SOC stocks varied. The mean ensemble estimate of SOC stocks was 30.08 t/ha with K-means coupled with multiple linear regression generating the highest estimate (mean SOC stocks at 38.15 t/ha) and MIMICS-PFT generating the lowest estimate (mean SOC stocks at 24.29 t/ha). We suggest that enhancing process-based models to incorporate newly identified drivers that significantly influence SOC variations in different environments could be key to reducing the discrepancies in these estimates. Our findings underscore the considerable uncertainty in SOC estimates derived from different modelling approaches and emphasize the importance of rigorous out-of-sample validation before applying any one approach in Australia.
利用机器学习和基于过程的建模对澳大利亚土壤有机碳进行集合估算
摘要土壤有机碳(SOC)的空间明确预测是有效土地管理战略的重要基础,旨在缓解土壤退化和评估固碳潜力。在此,我们利用 1000 多次原位观测,训练了两个机器学习模型(随机森林和 K-means 结合多元线性回归)和一个基于过程的模型(垂直解析的微生物-矿物碳稳定模型(MIMICS)),以预测澳大利亚土壤顶部 30 厘米的 SOC 含量。利用植物功能类型(MIMICS-PFT)和最有影响的环境因素(MIMICS-ENV)这两种不同的方法,针对不同的地点分组对 MIMICS 的参数进行了优化。我们发现,在大陆尺度上,土壤容重和年平均气温是 SOC 变化的主要控制因素,而不同植被类型的主要控制因素各不相同。在样本外验证过程中,所有模型都显示出良好的 SOC 预测性能,R2 大于 0.8,其中随机森林的预测精度最高,森林中的 SOC 比非森林土壤中的 SOC 更可预测。参数优化方法对 MIMICS SOC 预测性能的影响非常明显,MIMICS-ENV 的性能优于 MIMICS-PFT,尤其是在非森林土壤中。使用所有模型生成的陆地 SOC 储量数字地图显示出相似的空间分布,澳大利亚东南部和西南部的数值较高,但估计的 SOC 储量的大小各不相同。SOC 储量的平均集合估算值为 30.08 吨/公顷,其中 K-means 结合多元线性回归得出的估算值最高(SOC 储量平均值为 38.15 吨/公顷),而 MIMICS-PFT 得出的估算值最低(SOC 储量平均值为 24.29 吨/公顷)。我们认为,加强基于过程的模型,纳入新发现的对不同环境中 SOC 变化有显著影响的驱动因素,可能是减少这些估计值差异的关键。我们的研究结果表明,不同建模方法得出的 SOC 估算值具有相当大的不确定性,并强调了在澳大利亚应用任何一种方法之前进行严格的样本外验证的重要性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Soil
Soil Agricultural and Biological Sciences-Soil Science
CiteScore
10.80
自引率
2.90%
发文量
44
审稿时长
30 weeks
期刊介绍: SOIL is an international scientific journal dedicated to the publication and discussion of high-quality research in the field of soil system sciences. SOIL is at the interface between the atmosphere, lithosphere, hydrosphere, and biosphere. SOIL publishes scientific research that contributes to understanding the soil system and its interaction with humans and the entire Earth system. The scope of the journal includes all topics that fall within the study of soil science as a discipline, with an emphasis on studies that integrate soil science with other sciences (hydrology, agronomy, socio-economics, health sciences, atmospheric sciences, etc.).
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信