Xuebin Xu , Qiong Liu , Yalin Liu , Yongfu Li , Yixuan Chen , Tong Lei , Yakov Kuzyakov , Wenju Zhang , Jianping Chen , Tida Ge
{"title":"Novel soil health assessment framework for legume-based rotation farmland by interpretable machine learning with causal inference","authors":"Xuebin Xu , Qiong Liu , Yalin Liu , Yongfu Li , Yixuan Chen , Tong Lei , Yakov Kuzyakov , Wenju Zhang , Jianping Chen , Tida Ge","doi":"10.1016/j.compag.2025.111011","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate and robust soil health assessment is essential for sustaining legume-based rotation systems and informing their optimized management. To address the limitations of conventional methods in capturing management-induced variations, we developed an innovative framework grounded in the theoretical hypothesis that soil health reflects soil’s capacity to maximize production stability while minimizing input requirements. This framework synergistically integrates interpretable machine learning with causal inference and network analysis (CI-SHAP-NA), implementing a systematic workflow encompassing indicator selection, quantitative scoring, and multidimensional integration. Our framework was systematically implemented to assess soil health across diverse legume-based rotation systems in China. The results showed that CI-SHAP-NA identified a parsimonious yet highly informative set of indicators (soil organic carbon, available iron, and cellobiohydrolase) demonstrating superior explanatory power for critical soil ecological processes. The derived soil health index (SHI) by the CI-SHAP-NA framework demonstrated enhanced discriminative capacity (SHI range: 0.01−0.92) and strong concordance (R<sup>2</sup> = 0.80) with conventional total dataset assessment while maintaining significant predictive validity for crop productivity (Pearson’s <em>r</em> = 0.21, <em>p</em> < 0.001). It consistently outperformed PCA and NA methods in both explanatory power and fairness comparisons. The selected indicators proved robust and non-redundant, as substituting any indicator significantly reduced the correlation and sensitivity of SHI. Furthermore, CI-SHAP-NA demonstrated strong transferability, showing a stronger correlation with yield (<em>r</em> = 0.25, <em>p</em> < 0.001) on internally established independent sites than PCA and NA. This framework successfully resolved previously obscured soil health gradients between contrasting management systems, with paddy-legume rotations consistently outperforming their dryland counterparts − a differentiation rigorously validated against traditional benchmarks. These findings collectively establish the CI-SHAP-NA framework as a transformative tool for soil health assessment, offering substantial advantages over conventional approaches in terms of analytical robustness, ecological relevance, and practical utility. Future research should aim to incorporate multi-functional indicators as well as evaluate the framework’s performance across varied agroecosystems.</div></div>","PeriodicalId":50627,"journal":{"name":"Computers and Electronics in Agriculture","volume":"239 ","pages":"Article 111011"},"PeriodicalIF":8.9000,"publicationDate":"2025-09-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and Electronics in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0168169925011172","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate and robust soil health assessment is essential for sustaining legume-based rotation systems and informing their optimized management. To address the limitations of conventional methods in capturing management-induced variations, we developed an innovative framework grounded in the theoretical hypothesis that soil health reflects soil’s capacity to maximize production stability while minimizing input requirements. This framework synergistically integrates interpretable machine learning with causal inference and network analysis (CI-SHAP-NA), implementing a systematic workflow encompassing indicator selection, quantitative scoring, and multidimensional integration. Our framework was systematically implemented to assess soil health across diverse legume-based rotation systems in China. The results showed that CI-SHAP-NA identified a parsimonious yet highly informative set of indicators (soil organic carbon, available iron, and cellobiohydrolase) demonstrating superior explanatory power for critical soil ecological processes. The derived soil health index (SHI) by the CI-SHAP-NA framework demonstrated enhanced discriminative capacity (SHI range: 0.01−0.92) and strong concordance (R2 = 0.80) with conventional total dataset assessment while maintaining significant predictive validity for crop productivity (Pearson’s r = 0.21, p < 0.001). It consistently outperformed PCA and NA methods in both explanatory power and fairness comparisons. The selected indicators proved robust and non-redundant, as substituting any indicator significantly reduced the correlation and sensitivity of SHI. Furthermore, CI-SHAP-NA demonstrated strong transferability, showing a stronger correlation with yield (r = 0.25, p < 0.001) on internally established independent sites than PCA and NA. This framework successfully resolved previously obscured soil health gradients between contrasting management systems, with paddy-legume rotations consistently outperforming their dryland counterparts − a differentiation rigorously validated against traditional benchmarks. These findings collectively establish the CI-SHAP-NA framework as a transformative tool for soil health assessment, offering substantial advantages over conventional approaches in terms of analytical robustness, ecological relevance, and practical utility. Future research should aim to incorporate multi-functional indicators as well as evaluate the framework’s performance across varied agroecosystems.
期刊介绍:
Computers and Electronics in Agriculture provides international coverage of advancements in computer hardware, software, electronic instrumentation, and control systems applied to agricultural challenges. Encompassing agronomy, horticulture, forestry, aquaculture, and animal farming, the journal publishes original papers, reviews, and applications notes. It explores the use of computers and electronics in plant or animal agricultural production, covering topics like agricultural soils, water, pests, controlled environments, and waste. The scope extends to on-farm post-harvest operations and relevant technologies, including artificial intelligence, sensors, machine vision, robotics, networking, and simulation modeling. Its companion journal, Smart Agricultural Technology, continues the focus on smart applications in production agriculture.