Novel soil health assessment framework for legume-based rotation farmland by interpretable machine learning with causal inference

IF 8.9 1区 农林科学 Q1 AGRICULTURE, MULTIDISCIPLINARY
Xuebin Xu , Qiong Liu , Yalin Liu , Yongfu Li , Yixuan Chen , Tong Lei , Yakov Kuzyakov , Wenju Zhang , Jianping Chen , Tida Ge
{"title":"Novel soil health assessment framework for legume-based rotation farmland by interpretable machine learning with causal inference","authors":"Xuebin Xu ,&nbsp;Qiong Liu ,&nbsp;Yalin Liu ,&nbsp;Yongfu Li ,&nbsp;Yixuan Chen ,&nbsp;Tong Lei ,&nbsp;Yakov Kuzyakov ,&nbsp;Wenju Zhang ,&nbsp;Jianping Chen ,&nbsp;Tida Ge","doi":"10.1016/j.compag.2025.111011","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate and robust soil health assessment is essential for sustaining legume-based rotation systems and informing their optimized management. To address the limitations of conventional methods in capturing management-induced variations, we developed an innovative framework grounded in the theoretical hypothesis that soil health reflects soil’s capacity to maximize production stability while minimizing input requirements. This framework synergistically integrates interpretable machine learning with causal inference and network analysis (CI-SHAP-NA), implementing a systematic workflow encompassing indicator selection, quantitative scoring, and multidimensional integration. Our framework was systematically implemented to assess soil health across diverse legume-based rotation systems in China. The results showed that CI-SHAP-NA identified a parsimonious yet highly informative set of indicators (soil organic carbon, available iron, and cellobiohydrolase) demonstrating superior explanatory power for critical soil ecological processes. The derived soil health index (SHI) by the CI-SHAP-NA framework demonstrated enhanced discriminative capacity (SHI range: 0.01−0.92) and strong concordance (R<sup>2</sup> = 0.80) with conventional total dataset assessment while maintaining significant predictive validity for crop productivity (Pearson’s <em>r</em> = 0.21, <em>p</em> &lt; 0.001). It consistently outperformed PCA and NA methods in both explanatory power and fairness comparisons. The selected indicators proved robust and non-redundant, as substituting any indicator significantly reduced the correlation and sensitivity of SHI. Furthermore, CI-SHAP-NA demonstrated strong transferability, showing a stronger correlation with yield (<em>r</em> = 0.25, <em>p</em> &lt; 0.001) on internally established independent sites than PCA and NA. This framework successfully resolved previously obscured soil health gradients between contrasting management systems, with paddy-legume rotations consistently outperforming their dryland counterparts − a differentiation rigorously validated against traditional benchmarks. These findings collectively establish the CI-SHAP-NA framework as a transformative tool for soil health assessment, offering substantial advantages over conventional approaches in terms of analytical robustness, ecological relevance, and practical utility. Future research should aim to incorporate multi-functional indicators as well as evaluate the framework’s performance across varied agroecosystems.</div></div>","PeriodicalId":50627,"journal":{"name":"Computers and Electronics in Agriculture","volume":"239 ","pages":"Article 111011"},"PeriodicalIF":8.9000,"publicationDate":"2025-09-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and Electronics in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0168169925011172","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Accurate and robust soil health assessment is essential for sustaining legume-based rotation systems and informing their optimized management. To address the limitations of conventional methods in capturing management-induced variations, we developed an innovative framework grounded in the theoretical hypothesis that soil health reflects soil’s capacity to maximize production stability while minimizing input requirements. This framework synergistically integrates interpretable machine learning with causal inference and network analysis (CI-SHAP-NA), implementing a systematic workflow encompassing indicator selection, quantitative scoring, and multidimensional integration. Our framework was systematically implemented to assess soil health across diverse legume-based rotation systems in China. The results showed that CI-SHAP-NA identified a parsimonious yet highly informative set of indicators (soil organic carbon, available iron, and cellobiohydrolase) demonstrating superior explanatory power for critical soil ecological processes. The derived soil health index (SHI) by the CI-SHAP-NA framework demonstrated enhanced discriminative capacity (SHI range: 0.01−0.92) and strong concordance (R2 = 0.80) with conventional total dataset assessment while maintaining significant predictive validity for crop productivity (Pearson’s r = 0.21, p < 0.001). It consistently outperformed PCA and NA methods in both explanatory power and fairness comparisons. The selected indicators proved robust and non-redundant, as substituting any indicator significantly reduced the correlation and sensitivity of SHI. Furthermore, CI-SHAP-NA demonstrated strong transferability, showing a stronger correlation with yield (r = 0.25, p < 0.001) on internally established independent sites than PCA and NA. This framework successfully resolved previously obscured soil health gradients between contrasting management systems, with paddy-legume rotations consistently outperforming their dryland counterparts − a differentiation rigorously validated against traditional benchmarks. These findings collectively establish the CI-SHAP-NA framework as a transformative tool for soil health assessment, offering substantial advantages over conventional approaches in terms of analytical robustness, ecological relevance, and practical utility. Future research should aim to incorporate multi-functional indicators as well as evaluate the framework’s performance across varied agroecosystems.
基于因果推理的可解释机器学习豆科轮作农田土壤健康评价框架
准确而有力的土壤健康评估对于维持以豆科植物为基础的轮作系统和为其优化管理提供信息至关重要。为了解决传统方法在捕获管理引起的变化方面的局限性,我们基于土壤健康反映土壤在最大限度地减少投入需求的同时最大化生产稳定性的能力这一理论假设,开发了一个创新框架。该框架协同集成了可解释的机器学习与因果推理和网络分析(CI-SHAP-NA),实现了一个系统的工作流程,包括指标选择、定量评分和多维集成。系统地实施了我们的框架,以评估中国不同豆科植物轮作系统的土壤健康状况。结果表明,CI-SHAP-NA鉴定出一套简洁但信息量很大的指标(土壤有机碳、有效铁和纤维素生物水解酶),对关键的土壤生态过程具有很强的解释力。通过CI-SHAP-NA框架导出的土壤健康指数(SHI)与传统的总数据集评估显示出增强的判别能力(SHI范围:0.01 - 0.92)和强一致性(R2 = 0.80),同时对作物生产力保持显著的预测效度(Pearson 's r = 0.21, p < 0.001)。在解释力和公平性比较中,它始终优于PCA和NA方法。所选择的指标证明了鲁棒性和非冗余性,因为替换任何指标都会显著降低SHI的相关性和敏感性。此外,CI-SHAP-NA表现出很强的可转移性,与PCA和NA相比,在内部建立的独立位点上与产量的相关性更强(r = 0.25, p < 0.001)。该框架成功地解决了以前在不同管理系统之间模糊的土壤健康梯度,稻田-豆科作物轮作的表现始终优于旱地轮作,这一差异经过传统基准的严格验证。这些发现共同确立了CI-SHAP-NA框架作为土壤健康评估的变革性工具,在分析稳健性、生态相关性和实用性方面比传统方法具有实质性优势。未来的研究应旨在纳入多功能指标,并评估该框架在不同农业生态系统中的表现。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Computers and Electronics in Agriculture
Computers and Electronics in Agriculture 工程技术-计算机:跨学科应用
CiteScore
15.30
自引率
14.50%
发文量
800
审稿时长
62 days
期刊介绍: Computers and Electronics in Agriculture provides international coverage of advancements in computer hardware, software, electronic instrumentation, and control systems applied to agricultural challenges. Encompassing agronomy, horticulture, forestry, aquaculture, and animal farming, the journal publishes original papers, reviews, and applications notes. It explores the use of computers and electronics in plant or animal agricultural production, covering topics like agricultural soils, water, pests, controlled environments, and waste. The scope extends to on-farm post-harvest operations and relevant technologies, including artificial intelligence, sensors, machine vision, robotics, networking, and simulation modeling. Its companion journal, Smart Agricultural Technology, continues the focus on smart applications in production agriculture.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信