Toward Trustworthy Machine Learning for Daily Sediment Modeling in the Riverine Systems: An Integrated Framework With Enhanced Uncertainty Quantification and Interpretability
Z. J. Yue, N. N. Wang, B. D. Xu, X. Huang, D. M. Yang, H. B. Xiao, Z. H. Shi
{"title":"Toward Trustworthy Machine Learning for Daily Sediment Modeling in the Riverine Systems: An Integrated Framework With Enhanced Uncertainty Quantification and Interpretability","authors":"Z. J. Yue, N. N. Wang, B. D. Xu, X. Huang, D. M. Yang, H. B. Xiao, Z. H. Shi","doi":"10.1029/2024wr038650","DOIUrl":null,"url":null,"abstract":"Accurately predicting sediment dynamics and understanding their intrinsic contributors are pivotal for sustainable environment and water management. While machine learning (ML) enables precise predictions, its “black-box” nature hinders transparency and credibility, posing challenges in interpretability and uncertainty quantification (UQ). To achieve trustworthy ML for riverine sediment timeseries predictions, this study proposes an integrated ML framework, enhancing key steps: feature selection, UQ, and interpretation. Lagged hydro-environmental variables are incorporated via rigorous feature selection. SHapley Additive exPlanations (SHAP) and conformal prediction are utilized to refine interpretability and UQ, respectively. Based on 41-year multi-source data and three ensemble learning algorithms (LightGBM, XGBoost, and random forest (RF)), this study models daily suspended sediment concentration (SSC) separately for seven subtropical watersheds and evaluates overall and local accuracy. Key findings include: (a) Discharge and precipitation dominate SSC variability (explaining ∼56.8% and ∼18.9% of the variability, respectively). Sampling-day discharge and accumulative lagged precipitation should be prioritized as predictors. Precipitation-discharge interaction effects on SSC exhibit simple threshold effects, whereas the interaction effects of hydrological (precipitation, discharge) and environmental (SPEI, land cover) factors involve complex, bidirectional threshold effects. (b) LightGBM and XGBoost excel in long-term/general prediction, while RF outperform for short-term/extreme value predictions. (c) Conformal prediction-based UQ provides probabilistic information to quantify prediction reliability and efficiency, alongside uncertainty sources: discharge (∼38.9%) > precipitation (∼33.4%) > land cover (∼19.6%) > SPEI (∼8.1%). This framework advances trustworthy ML in riverine sediment modeling, while its algorithm-agnostic design ensures potential scalability to support broader hydrological applications and informed environmental decision-making.","PeriodicalId":23799,"journal":{"name":"Water Resources Research","volume":"77 1","pages":""},"PeriodicalIF":4.6000,"publicationDate":"2025-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Water Resources Research","FirstCategoryId":"89","ListUrlMain":"https://doi.org/10.1029/2024wr038650","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Accurately predicting sediment dynamics and understanding their intrinsic contributors are pivotal for sustainable environment and water management. While machine learning (ML) enables precise predictions, its “black-box” nature hinders transparency and credibility, posing challenges in interpretability and uncertainty quantification (UQ). To achieve trustworthy ML for riverine sediment timeseries predictions, this study proposes an integrated ML framework, enhancing key steps: feature selection, UQ, and interpretation. Lagged hydro-environmental variables are incorporated via rigorous feature selection. SHapley Additive exPlanations (SHAP) and conformal prediction are utilized to refine interpretability and UQ, respectively. Based on 41-year multi-source data and three ensemble learning algorithms (LightGBM, XGBoost, and random forest (RF)), this study models daily suspended sediment concentration (SSC) separately for seven subtropical watersheds and evaluates overall and local accuracy. Key findings include: (a) Discharge and precipitation dominate SSC variability (explaining ∼56.8% and ∼18.9% of the variability, respectively). Sampling-day discharge and accumulative lagged precipitation should be prioritized as predictors. Precipitation-discharge interaction effects on SSC exhibit simple threshold effects, whereas the interaction effects of hydrological (precipitation, discharge) and environmental (SPEI, land cover) factors involve complex, bidirectional threshold effects. (b) LightGBM and XGBoost excel in long-term/general prediction, while RF outperform for short-term/extreme value predictions. (c) Conformal prediction-based UQ provides probabilistic information to quantify prediction reliability and efficiency, alongside uncertainty sources: discharge (∼38.9%) > precipitation (∼33.4%) > land cover (∼19.6%) > SPEI (∼8.1%). This framework advances trustworthy ML in riverine sediment modeling, while its algorithm-agnostic design ensures potential scalability to support broader hydrological applications and informed environmental decision-making.
期刊介绍:
Water Resources Research (WRR) is an interdisciplinary journal that focuses on hydrology and water resources. It publishes original research in the natural and social sciences of water. It emphasizes the role of water in the Earth system, including physical, chemical, biological, and ecological processes in water resources research and management, including social, policy, and public health implications. It encompasses observational, experimental, theoretical, analytical, numerical, and data-driven approaches that advance the science of water and its management. Submissions are evaluated for their novelty, accuracy, significance, and broader implications of the findings.