Xuke Wu , Kun Shan , Friedrich Recknagel , Lan Wang , Mingsheng Shang
{"title":"Enhanced tensor factorization for spatiotemporal imputation of high-frequency water quality monitoring data","authors":"Xuke Wu , Kun Shan , Friedrich Recknagel , Lan Wang , Mingsheng Shang","doi":"10.1016/j.envsoft.2025.106667","DOIUrl":null,"url":null,"abstract":"<div><div>Automatic high-frequency monitoring (AHFM) of water quality is increasingly being deployed to enhance the management and scientific understanding of eutrophic lakes. However, the prevalence of missing data in such systems poses significant challenges, potentially compromising decision-making processes and distorting analytical or modelling outcomes. This study proposes an enhanced tensor factorization model, termed the Diverse Biases-integrated Adaptive Latent-factorization-of-tensors (DBAL), which decomposes high-dimensional data into low-rank components while incorporating diverse linear biases to capture temporal fluctuations and employing a differential evolution algorithm for adaptive hyperparameter optimization. Extensive empirical validation using real-world AHFM datasets from a large eutrophic lake in China demonstrates that our proposed DBAL consistently outperforms the state-of-the-art models, achieving 5.6 %–50.3 % and 5.7 %–43.7 % reductions in RMSE and MAE, alongside a 0.3 %–6.1 % improvement in R<sup>2</sup>. Notably, DBAL demonstrates variable-specific performance, with particularly accurate imputation for stable physical parameters while revealing greater challenges for biologically-active variables that exhibit stronger temporal dynamics.</div></div>","PeriodicalId":310,"journal":{"name":"Environmental Modelling & Software","volume":"193 ","pages":"Article 106667"},"PeriodicalIF":4.6000,"publicationDate":"2025-08-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Modelling & Software","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1364815225003512","RegionNum":2,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Automatic high-frequency monitoring (AHFM) of water quality is increasingly being deployed to enhance the management and scientific understanding of eutrophic lakes. However, the prevalence of missing data in such systems poses significant challenges, potentially compromising decision-making processes and distorting analytical or modelling outcomes. This study proposes an enhanced tensor factorization model, termed the Diverse Biases-integrated Adaptive Latent-factorization-of-tensors (DBAL), which decomposes high-dimensional data into low-rank components while incorporating diverse linear biases to capture temporal fluctuations and employing a differential evolution algorithm for adaptive hyperparameter optimization. Extensive empirical validation using real-world AHFM datasets from a large eutrophic lake in China demonstrates that our proposed DBAL consistently outperforms the state-of-the-art models, achieving 5.6 %–50.3 % and 5.7 %–43.7 % reductions in RMSE and MAE, alongside a 0.3 %–6.1 % improvement in R2. Notably, DBAL demonstrates variable-specific performance, with particularly accurate imputation for stable physical parameters while revealing greater challenges for biologically-active variables that exhibit stronger temporal dynamics.
期刊介绍:
Environmental Modelling & Software publishes contributions, in the form of research articles, reviews and short communications, on recent advances in environmental modelling and/or software. The aim is to improve our capacity to represent, understand, predict or manage the behaviour of environmental systems at all practical scales, and to communicate those improvements to a wide scientific and professional audience.