Ali El Bilali, Youssef Brouziyne, Oumaima Attar, Houda Lamane, Abdessamad Hadri, Abdeslam Taleb
{"title":"Physics-informed machine learning algorithms for forecasting sediment yield: an analysis of physical consistency, sensitivity, and interpretability.","authors":"Ali El Bilali, Youssef Brouziyne, Oumaima Attar, Houda Lamane, Abdessamad Hadri, Abdeslam Taleb","doi":"10.1007/s11356-024-34245-2","DOIUrl":null,"url":null,"abstract":"<p><p>The sediment transport, involving the movement of the bedload and suspended sediment in the basins, is a critical environmental concern that worsens water scarcity and leads to degradation of land and its ecosystems. Machine learning (ML) algorithms have emerged as powerful tools for predicting sediment yield. However, their use by decision-makers can be attributed to concerns regarding their consistency with the involved physical processes. In light of this issue, this study aims to develop a physics-informed ML approach for predicting sediment yield. To achieve this objective, Gaussian, Center, Regular, and Direct Copulas were employed to generate virtual combinations of physical of the sub-basins and hydrological datasets. These datasets were then utilized to train deep neural network (DNN), conventional neural network (CNN), Extra Tree, and XGBoost (XGB) models. The performance of these models was compared with the modified universal soil loss equation (MUSLE), which serves as a process-based model. The results demonstrated that the ML models outperformed the MUSLE model, exhibiting improvements in Nash-Sutcliffe efficiency (NSE) of approximately 10%, 18%, 32%, and 41% for the DNN, CNN, Extra Tree, and XGB models, respectively. Furthermore, through Sobol sensitivity and Shapley additive explanation-based interpretability analyses, it was revealed that the Extra Tree model displayed greater consistency with the physical processes underlying sediment transport as modeled by MUSLE. The proposed framework provides new insights into enhancing the accuracy and applicability of ML models in forecasting sediment yield while maintaining consistency with natural processes. Consequently, it can prove valuable in simulating process-related strategies aimed at mitigating sediment transport at watershed scales, such as the implementation of best management practices.</p>","PeriodicalId":545,"journal":{"name":"Environmental Science and Pollution Research","volume":null,"pages":null},"PeriodicalIF":5.8000,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Science and Pollution Research","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1007/s11356-024-34245-2","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/7/11 0:00:00","PubModel":"Epub","JCR":"N/A","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
The sediment transport, involving the movement of the bedload and suspended sediment in the basins, is a critical environmental concern that worsens water scarcity and leads to degradation of land and its ecosystems. Machine learning (ML) algorithms have emerged as powerful tools for predicting sediment yield. However, their use by decision-makers can be attributed to concerns regarding their consistency with the involved physical processes. In light of this issue, this study aims to develop a physics-informed ML approach for predicting sediment yield. To achieve this objective, Gaussian, Center, Regular, and Direct Copulas were employed to generate virtual combinations of physical of the sub-basins and hydrological datasets. These datasets were then utilized to train deep neural network (DNN), conventional neural network (CNN), Extra Tree, and XGBoost (XGB) models. The performance of these models was compared with the modified universal soil loss equation (MUSLE), which serves as a process-based model. The results demonstrated that the ML models outperformed the MUSLE model, exhibiting improvements in Nash-Sutcliffe efficiency (NSE) of approximately 10%, 18%, 32%, and 41% for the DNN, CNN, Extra Tree, and XGB models, respectively. Furthermore, through Sobol sensitivity and Shapley additive explanation-based interpretability analyses, it was revealed that the Extra Tree model displayed greater consistency with the physical processes underlying sediment transport as modeled by MUSLE. The proposed framework provides new insights into enhancing the accuracy and applicability of ML models in forecasting sediment yield while maintaining consistency with natural processes. Consequently, it can prove valuable in simulating process-related strategies aimed at mitigating sediment transport at watershed scales, such as the implementation of best management practices.
期刊介绍:
Environmental Science and Pollution Research (ESPR) serves the international community in all areas of Environmental Science and related subjects with emphasis on chemical compounds. This includes:
- Terrestrial Biology and Ecology
- Aquatic Biology and Ecology
- Atmospheric Chemistry
- Environmental Microbiology/Biobased Energy Sources
- Phytoremediation and Ecosystem Restoration
- Environmental Analyses and Monitoring
- Assessment of Risks and Interactions of Pollutants in the Environment
- Conservation Biology and Sustainable Agriculture
- Impact of Chemicals/Pollutants on Human and Animal Health
It reports from a broad interdisciplinary outlook.