{"title":"Comparison features importance for temporal and spatial-temporal approaches in GRACE and GRACE-FO signal reconstruction based on GLDAS data","authors":"Viktor Szabó","doi":"10.1504/ijhst.2023.134623","DOIUrl":null,"url":null,"abstract":"Machine learning algorithms can effectively learn the complex relationships between various input variables from the global land data assimilation system (GLDAS) and the total water storage (TWS) observed by gravity recovery and climate experiment (GRACE) and GRACE-FO (follow-on) missions. As the TWS depends on various features, a serious question arises about the importance of used variables for reconstruction. Furthermore, will the variables used for the reconstruction be equally significant for grid-based and basin-based analyses? This work examined the importance of individual predictors for the temporal and spatial-temporal approach over 254 river basins using GRACE and GRACE-FO data as target and GLDAS data as predictors. The extreme gradient boosting (XGBoost) algorithm was used to reconstruct TWS. Results were evaluated with root-mean-square error, normalised root-mean-square error, Pearson correlation coefficient, Nash-Sutcliffe efficiency, and Kolmogorov-Smirnow-test metrics. Model output influence was checked by the model-agnostic version of the feature importance and by Shapley additive explanations (SHAP).","PeriodicalId":0,"journal":{"name":"","volume":null,"pages":null},"PeriodicalIF":0.0,"publicationDate":"2023-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1504/ijhst.2023.134623","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Machine learning algorithms can effectively learn the complex relationships between various input variables from the global land data assimilation system (GLDAS) and the total water storage (TWS) observed by gravity recovery and climate experiment (GRACE) and GRACE-FO (follow-on) missions. As the TWS depends on various features, a serious question arises about the importance of used variables for reconstruction. Furthermore, will the variables used for the reconstruction be equally significant for grid-based and basin-based analyses? This work examined the importance of individual predictors for the temporal and spatial-temporal approach over 254 river basins using GRACE and GRACE-FO data as target and GLDAS data as predictors. The extreme gradient boosting (XGBoost) algorithm was used to reconstruct TWS. Results were evaluated with root-mean-square error, normalised root-mean-square error, Pearson correlation coefficient, Nash-Sutcliffe efficiency, and Kolmogorov-Smirnow-test metrics. Model output influence was checked by the model-agnostic version of the feature importance and by Shapley additive explanations (SHAP).