{"title":"Bayesian-optimized recursive machine learning for predicting human-induced changes in suspended sediment transport","authors":"Soumya Kundu, Somil Swarnkar, Akshay Agarwal","doi":"10.1007/s10661-025-14039-w","DOIUrl":null,"url":null,"abstract":"<div><p>The suspended sediment load (<i>SSL</i>) of a river is a key indicator of water resource management, river morphology, and ecosystem health. This study analyzes historical changes in <i>SSL</i> and evaluates machine learning (ML) models for <i>SSL</i> prediction in the Godavari River Basin. The dataset was divided into pre-1990 (1969–1990) and post-1990 (1990–2020) periods, revealing a significant decline in mean annual <i>SSL</i> from 136.85 to 62.38 million tons post-1990 due to anthropogenic influences such as dam construction and land-use/land-cover (LULC) changes. Despite a consistent seasonal distribution (~ 73% <i>SSL</i> contribution from monsoon months in both periods), there was a notable decline in median and peak <i>SSL</i> values, along with a narrowing interquartile range, indicating reduced sediment availability. The empirical cumulative distribution function (ECDF) further revealed shifts in sediment transport, with post-1990 <i>SSL</i> values surpassing pre-1990 levels at higher cumulative probabilities, suggesting altered sediment retention and release patterns. To improve <i>SSL</i> prediction, tree-based ML models were developed and evaluated using <i>R</i><sup>2</sup>, <i>RMSE</i>, and <i>MAE</i> metrics. Among them, the extra trees regressor (ETR) demonstrated the highest predictive accuracy (<i>R</i><sup>2</sup> = 0.97 in training, 0.9 in testing) with the lowest errors, while the random forest regressor (RFR) and gradient boosting regressor (GBR) provided competitive results. The findings highlight the impact of human modifications on sediment transport and emphasize that ensemble tree-based models offer a robust solution for <i>SSL</i> prediction. This study provides valuable insights for river basin management and sustainable sediment transport modeling under changing hydrological conditions.</p></div>","PeriodicalId":544,"journal":{"name":"Environmental Monitoring and Assessment","volume":"197 5","pages":""},"PeriodicalIF":2.9000,"publicationDate":"2025-04-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Monitoring and Assessment","FirstCategoryId":"93","ListUrlMain":"https://link.springer.com/article/10.1007/s10661-025-14039-w","RegionNum":4,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
The suspended sediment load (SSL) of a river is a key indicator of water resource management, river morphology, and ecosystem health. This study analyzes historical changes in SSL and evaluates machine learning (ML) models for SSL prediction in the Godavari River Basin. The dataset was divided into pre-1990 (1969–1990) and post-1990 (1990–2020) periods, revealing a significant decline in mean annual SSL from 136.85 to 62.38 million tons post-1990 due to anthropogenic influences such as dam construction and land-use/land-cover (LULC) changes. Despite a consistent seasonal distribution (~ 73% SSL contribution from monsoon months in both periods), there was a notable decline in median and peak SSL values, along with a narrowing interquartile range, indicating reduced sediment availability. The empirical cumulative distribution function (ECDF) further revealed shifts in sediment transport, with post-1990 SSL values surpassing pre-1990 levels at higher cumulative probabilities, suggesting altered sediment retention and release patterns. To improve SSL prediction, tree-based ML models were developed and evaluated using R2, RMSE, and MAE metrics. Among them, the extra trees regressor (ETR) demonstrated the highest predictive accuracy (R2 = 0.97 in training, 0.9 in testing) with the lowest errors, while the random forest regressor (RFR) and gradient boosting regressor (GBR) provided competitive results. The findings highlight the impact of human modifications on sediment transport and emphasize that ensemble tree-based models offer a robust solution for SSL prediction. This study provides valuable insights for river basin management and sustainable sediment transport modeling under changing hydrological conditions.
期刊介绍:
Environmental Monitoring and Assessment emphasizes technical developments and data arising from environmental monitoring and assessment, the use of scientific principles in the design of monitoring systems at the local, regional and global scales, and the use of monitoring data in assessing the consequences of natural resource management actions and pollution risks to man and the environment.