{"title":"Improving Linear Interpolation of Missing Hydrological Data by Applying Integrated Autoregressive Models","authors":"Tomasz Niedzielski, Michał Halicki","doi":"10.1007/s11269-023-03625-7","DOIUrl":null,"url":null,"abstract":"Abstract The application of linear interpolation for handling missing hydrological data is unequivocal. On one hand, such an approach offers good reconstruction in the vicinity of last observation before a no-data gap and first measurement after the gap. On the other hand, it omits irregular variability of hydrological data. Such an irregularity can be described by time series models, such as for instance the autoregressive integrated moving average (ARIMA) model. Herein, we propose a method which combines linear interpolation with autoregressive integrated model (ARI, i.e. ARIMA without a moving average part), named LinAR (available at GitHub), as a tool for inputing hydrological data. Linear interpolation is combined with the ARI model through linear scaling the ARI-based prediction issued for the no-data gap. Such an approach contributes to the current state of art in gap-filling methods since it removes artificial jumps between last stochastic prediction and first known observation after the gap, also introducing some irregular variability in the first part of the no-data gap. The LinAR method is applied and evaluated on hourly water level data collected between 2016 and 2021 (52,608 hourly steps) from 28 gauges strategically located within the Odra/Oder River basin in southwestern and western Poland. The data was sourced from Institute of Meteorology and Water Management (Poland). Evaluating the performance with over 100 million assessments in the validation experiment, the study demonstrates that the LinAR approach outperforms the purely linear method, especially for short no-data gaps (up to 12 hourly steps) and for rivers of considerable size. Based on rigorous statistical analysis of root mean square error (RMSE) – expressed (1) absolutely, (2) as percentages and (3) using RMSE error bars – the percentage improvement, understood as percentage difference between RMSE of linear and LinAR interpolations, was found to reach up to 10%.","PeriodicalId":23611,"journal":{"name":"Water Resources Management","volume":"21 1","pages":"0"},"PeriodicalIF":3.9000,"publicationDate":"2023-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Water Resources Management","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s11269-023-03625-7","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, CIVIL","Score":null,"Total":0}
引用次数: 0
Abstract
Abstract The application of linear interpolation for handling missing hydrological data is unequivocal. On one hand, such an approach offers good reconstruction in the vicinity of last observation before a no-data gap and first measurement after the gap. On the other hand, it omits irregular variability of hydrological data. Such an irregularity can be described by time series models, such as for instance the autoregressive integrated moving average (ARIMA) model. Herein, we propose a method which combines linear interpolation with autoregressive integrated model (ARI, i.e. ARIMA without a moving average part), named LinAR (available at GitHub), as a tool for inputing hydrological data. Linear interpolation is combined with the ARI model through linear scaling the ARI-based prediction issued for the no-data gap. Such an approach contributes to the current state of art in gap-filling methods since it removes artificial jumps between last stochastic prediction and first known observation after the gap, also introducing some irregular variability in the first part of the no-data gap. The LinAR method is applied and evaluated on hourly water level data collected between 2016 and 2021 (52,608 hourly steps) from 28 gauges strategically located within the Odra/Oder River basin in southwestern and western Poland. The data was sourced from Institute of Meteorology and Water Management (Poland). Evaluating the performance with over 100 million assessments in the validation experiment, the study demonstrates that the LinAR approach outperforms the purely linear method, especially for short no-data gaps (up to 12 hourly steps) and for rivers of considerable size. Based on rigorous statistical analysis of root mean square error (RMSE) – expressed (1) absolutely, (2) as percentages and (3) using RMSE error bars – the percentage improvement, understood as percentage difference between RMSE of linear and LinAR interpolations, was found to reach up to 10%.
期刊介绍:
Water Resources Management is an international, multidisciplinary forum for the publication of original contributions and the exchange of knowledge and experience on the management of water resources. In particular, the journal publishes contributions on water resources assessment, development, conservation and control, emphasizing policies and strategies. Contributions examine planning and design of water resource systems, and
operation, maintenance and administration of water resource systems.
Coverage extends to these closely related topics: water demand and consumption; applied surface and groundwater hydrology; water management techniques; simulation and modelling of water resource systems; forecasting and control of quantity and quality of water; economic and social aspects of water use; legislation and water resources protection.
Water Resources Management is supported scientifically by the European Water Resources Association, a scientific and technical nonprofit-making European association.