{"title":"从数据到决策:利用 ML 改进孟加拉国的河流泄量预报","authors":"Md. Abu Saleh, H.M. Rasel, Briti Ray","doi":"10.1016/j.wsee.2024.09.004","DOIUrl":null,"url":null,"abstract":"<div><div>River discharge forecasting stands at the forefront of environmental management, contributing significantly to sustainable development through its impact on flood prevention, water resource management, ecological conservation, and energy production. This study forecasted the annual river discharge forecasting in the Nilphamari district of Bangladesh, employing random forest (RF), support vector machine (SVM), and gradient boosting machine (GBM) techniques. Historical river discharge data spanning from 1990 to 2020, obtained from eight surface water stations, forms the basis of the analysis. The forecast was performed from 2021 to 2030. 11 statistical parameters were considered for performance evaluation. Additionally, four evaluation plots, comprising a quantile–quantile plot (QQ plot), a residual plot, a Bland Altman plot, and Theil’s U statistic, were employed for a detailed understanding of model accuracy. Results demonstrate that the random forest regression technique exhibited superior accuracy compared to SVM and GBM in training and testing stages. Notably, the coefficient of determination reached 97 % during the testing phase, emphasizing the robustness of this model. While Mean Absolute Error is lower (1085.071 cubic meter per second), in training, the model captures relative changes (Mean Absolute Percentage Error = 0.154) better during prediction. Willmott’s Index in training (0.77) and testing (0.55) suggest the model memorizes training data well and outperforms the other models in testing stage. The findings underscore the efficacy of RF regression as a superior alternative for short-term discharge forecasting, offering valuable insights for integrated water resources management, particularly in flood warning systems and the expansion of irrigation initiatives.</div></div>","PeriodicalId":101280,"journal":{"name":"Watershed Ecology and the Environment","volume":"6 ","pages":"Pages 209-226"},"PeriodicalIF":0.0000,"publicationDate":"2024-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"From data to decisions: Leveraging ML for improved river discharge forecasting in Bangladesh\",\"authors\":\"Md. Abu Saleh, H.M. Rasel, Briti Ray\",\"doi\":\"10.1016/j.wsee.2024.09.004\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>River discharge forecasting stands at the forefront of environmental management, contributing significantly to sustainable development through its impact on flood prevention, water resource management, ecological conservation, and energy production. This study forecasted the annual river discharge forecasting in the Nilphamari district of Bangladesh, employing random forest (RF), support vector machine (SVM), and gradient boosting machine (GBM) techniques. Historical river discharge data spanning from 1990 to 2020, obtained from eight surface water stations, forms the basis of the analysis. The forecast was performed from 2021 to 2030. 11 statistical parameters were considered for performance evaluation. Additionally, four evaluation plots, comprising a quantile–quantile plot (QQ plot), a residual plot, a Bland Altman plot, and Theil’s U statistic, were employed for a detailed understanding of model accuracy. Results demonstrate that the random forest regression technique exhibited superior accuracy compared to SVM and GBM in training and testing stages. Notably, the coefficient of determination reached 97 % during the testing phase, emphasizing the robustness of this model. While Mean Absolute Error is lower (1085.071 cubic meter per second), in training, the model captures relative changes (Mean Absolute Percentage Error = 0.154) better during prediction. Willmott’s Index in training (0.77) and testing (0.55) suggest the model memorizes training data well and outperforms the other models in testing stage. The findings underscore the efficacy of RF regression as a superior alternative for short-term discharge forecasting, offering valuable insights for integrated water resources management, particularly in flood warning systems and the expansion of irrigation initiatives.</div></div>\",\"PeriodicalId\":101280,\"journal\":{\"name\":\"Watershed Ecology and the Environment\",\"volume\":\"6 \",\"pages\":\"Pages 209-226\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Watershed Ecology and the Environment\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2589471424000172\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Watershed Ecology and the Environment","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2589471424000172","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
From data to decisions: Leveraging ML for improved river discharge forecasting in Bangladesh
River discharge forecasting stands at the forefront of environmental management, contributing significantly to sustainable development through its impact on flood prevention, water resource management, ecological conservation, and energy production. This study forecasted the annual river discharge forecasting in the Nilphamari district of Bangladesh, employing random forest (RF), support vector machine (SVM), and gradient boosting machine (GBM) techniques. Historical river discharge data spanning from 1990 to 2020, obtained from eight surface water stations, forms the basis of the analysis. The forecast was performed from 2021 to 2030. 11 statistical parameters were considered for performance evaluation. Additionally, four evaluation plots, comprising a quantile–quantile plot (QQ plot), a residual plot, a Bland Altman plot, and Theil’s U statistic, were employed for a detailed understanding of model accuracy. Results demonstrate that the random forest regression technique exhibited superior accuracy compared to SVM and GBM in training and testing stages. Notably, the coefficient of determination reached 97 % during the testing phase, emphasizing the robustness of this model. While Mean Absolute Error is lower (1085.071 cubic meter per second), in training, the model captures relative changes (Mean Absolute Percentage Error = 0.154) better during prediction. Willmott’s Index in training (0.77) and testing (0.55) suggest the model memorizes training data well and outperforms the other models in testing stage. The findings underscore the efficacy of RF regression as a superior alternative for short-term discharge forecasting, offering valuable insights for integrated water resources management, particularly in flood warning systems and the expansion of irrigation initiatives.