Xinyu Chang , Jun Guo , Yi Liu , Xiangqian Wei , Xinying Wang , Hui Qin
{"title":"Study on runoff forecasting and error correction driven by atmosphere–ocean-land dataset","authors":"Xinyu Chang , Jun Guo , Yi Liu , Xiangqian Wei , Xinying Wang , Hui Qin","doi":"10.1016/j.eswa.2024.125744","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate runoff forecasting results can not only provide an important basis for flood control scheduling, but also provide scientific support for water resources optimization, which promotes the maximization of the overall benefits of the basin. To further explore the inherent mechanisms of the atmosphere–ocean-land factors driving runoff changes, this study proposes the factors dimension reduction and interpretation framework based on Pearson, eXtreme Gradient Boosting and SHapley Additive exPlanations (P-XGBoost-SHAP). Base on this, the Gaussian Process Regression (GPR), Long Short-Term Memory neural network (LSTM) and Support Vector Machine (SVM) models are used to construct the atmospheric-ocean-land data-driven runoff prediction model. Meanwhile, for the runoff prediction residuals, this paper proposes an error multi-step correction framework based on ensemble empirical mode decomposition and autoregressive model (EEMD-AR). The case study of Lianghekou hydrological station shows that the factor dimension reduction and interpretation framework can greatly reduce the input dimension of the model, and explain the factors globally and locally by using SHAP Value. Compared with the traditional Random Forest (RF) dimension reduction method, it shows higher prediction accuracy. The Nash-Sutcliffe efficiency coefficient (<em>NSE</em>) can be increased to about 0.93, which is 4.91 % and 1.97 % higher than the series–parallel coupling (AR-Parallel) and empirical mode decomposition-autoregressive (EMD-AR) correction methods, respectively. The accuracy of the runoff forecasting prediction is improved while reducing the input dimension of the model.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"263 ","pages":"Article 125744"},"PeriodicalIF":7.5000,"publicationDate":"2024-11-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417424026113","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate runoff forecasting results can not only provide an important basis for flood control scheduling, but also provide scientific support for water resources optimization, which promotes the maximization of the overall benefits of the basin. To further explore the inherent mechanisms of the atmosphere–ocean-land factors driving runoff changes, this study proposes the factors dimension reduction and interpretation framework based on Pearson, eXtreme Gradient Boosting and SHapley Additive exPlanations (P-XGBoost-SHAP). Base on this, the Gaussian Process Regression (GPR), Long Short-Term Memory neural network (LSTM) and Support Vector Machine (SVM) models are used to construct the atmospheric-ocean-land data-driven runoff prediction model. Meanwhile, for the runoff prediction residuals, this paper proposes an error multi-step correction framework based on ensemble empirical mode decomposition and autoregressive model (EEMD-AR). The case study of Lianghekou hydrological station shows that the factor dimension reduction and interpretation framework can greatly reduce the input dimension of the model, and explain the factors globally and locally by using SHAP Value. Compared with the traditional Random Forest (RF) dimension reduction method, it shows higher prediction accuracy. The Nash-Sutcliffe efficiency coefficient (NSE) can be increased to about 0.93, which is 4.91 % and 1.97 % higher than the series–parallel coupling (AR-Parallel) and empirical mode decomposition-autoregressive (EMD-AR) correction methods, respectively. The accuracy of the runoff forecasting prediction is improved while reducing the input dimension of the model.
期刊介绍:
Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.