Combining wavelet-enhanced feature selection and deep learning techniques for multi-step forecasting of urban water demand

Environmental research: infrastructure and sustainability Pub Date : 2024-07-02 DOI:10.1088/2634-4505/ad5e1d

Wenjin Hao, Andrea Cominola, Andrea Castelletti

{"title":"Combining wavelet-enhanced feature selection and deep learning techniques for multi-step forecasting of urban water demand","authors":"Wenjin Hao, Andrea Cominola, Andrea Castelletti","doi":"10.1088/2634-4505/ad5e1d","DOIUrl":null,"url":null,"abstract":"\n Urban water demand (UWD) forecasting is essential for water supply network optimization and management, both in business-as-usual scenarios, as well as under external climate and socio-economic stressors. Different machine learning and deep learning models have shown promising forecasting skills in various areas of application. However, their potential to forecast multi-step ahead UWD has not been fully explored. Modelling uncertain UWD patterns and accounting for variations in water demand behaviors require techniques that can extract time-varying information and multi-scale changes. In this research, we comparatively investigate different state-of-the-art machine learning- and deep learning-based predictive models on 1-day- and 7-day-ahead UWD forecasting, using daily demand data from the city of Milan, Italy. The contribution of this paper is two-fold. First, we compare the forecasting performance of different machine learning and deep learning models on single- and multi-step daily UWD forecasting. These models include Artificial Neural Network (ANN), Support Vector Regression (SVR), Light Gradient Boosting Machine (LightGBM), and Long Short-Term Memory network with and without an attention mechanism (LSTM and AM-LSTM). We benchmark their prediction accuracy against autoregressive time series models. Second, we investigate the potential enhancement in predictive accuracy by incorporating the wavelet transform and feature selection performed by LightGBM into these models. Results show that, overall, wavelet-enhanced feature selection improves the model predictive performance. The hybrid model combining wavelet-enhanced feature selection via LightGBM with LSTM (WT-LightGBM-(AM)-LSTM) can achieve high levels of accuracy with Nash-Sutcliffe Efficiency larger than 0.95 and Kling–Gupta Efficiency higher than 0.93 for both 1-day- and 7-day-ahead UWD forecasts. Furthermore, performance is shown to be robust under the influence of external stressors causing sudden changes in UWD.","PeriodicalId":476263,"journal":{"name":"Environmental research: infrastructure and sustainability","volume":"9 5","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental research: infrastructure and sustainability","FirstCategoryId":"0","ListUrlMain":"https://doi.org/10.1088/2634-4505/ad5e1d","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Urban water demand (UWD) forecasting is essential for water supply network optimization and management, both in business-as-usual scenarios, as well as under external climate and socio-economic stressors. Different machine learning and deep learning models have shown promising forecasting skills in various areas of application. However, their potential to forecast multi-step ahead UWD has not been fully explored. Modelling uncertain UWD patterns and accounting for variations in water demand behaviors require techniques that can extract time-varying information and multi-scale changes. In this research, we comparatively investigate different state-of-the-art machine learning- and deep learning-based predictive models on 1-day- and 7-day-ahead UWD forecasting, using daily demand data from the city of Milan, Italy. The contribution of this paper is two-fold. First, we compare the forecasting performance of different machine learning and deep learning models on single- and multi-step daily UWD forecasting. These models include Artificial Neural Network (ANN), Support Vector Regression (SVR), Light Gradient Boosting Machine (LightGBM), and Long Short-Term Memory network with and without an attention mechanism (LSTM and AM-LSTM). We benchmark their prediction accuracy against autoregressive time series models. Second, we investigate the potential enhancement in predictive accuracy by incorporating the wavelet transform and feature selection performed by LightGBM into these models. Results show that, overall, wavelet-enhanced feature selection improves the model predictive performance. The hybrid model combining wavelet-enhanced feature selection via LightGBM with LSTM (WT-LightGBM-(AM)-LSTM) can achieve high levels of accuracy with Nash-Sutcliffe Efficiency larger than 0.95 and Kling–Gupta Efficiency higher than 0.93 for both 1-day- and 7-day-ahead UWD forecasts. Furthermore, performance is shown to be robust under the influence of external stressors causing sudden changes in UWD.

查看原文本刊更多论文

结合小波增强特征选择和深度学习技术，对城市用水需求进行多步骤预测

无论是在 "一切照旧 "的情况下，还是在外部气候和社会经济压力下，城市需水量（UWD）预测对于供水管网的优化和管理都至关重要。不同的机器学习和深度学习模型在不同的应用领域都显示出了良好的预测能力。然而，它们在预测多步超前用水量减少方面的潜力尚未得到充分挖掘。对不确定的超前用水模式进行建模并考虑需水行为的变化，需要能够提取时变信息和多尺度变化的技术。在这项研究中，我们利用意大利米兰市的每日需求数据，对基于机器学习和深度学习的不同先进预测模型进行了比较研究，以预测 1 天和 7 天的未用水量。本文有两方面的贡献。首先，我们比较了不同的机器学习和深度学习模型在单步和多步每日用水量预测中的预测性能。这些模型包括人工神经网络（ANN）、支持向量回归（SVR）、轻梯度提升机（LightGBM）以及带有和不带有注意力机制的长短期记忆网络（LSTM 和 AM-LSTM）。我们以自回归时间序列模型为基准来衡量它们的预测准确性。其次，我们研究了将 LightGBM 执行的小波变换和特征选择纳入这些模型后，预测准确性的潜在提升。结果表明，总体而言，小波增强特征选择提高了模型的预测性能。通过 LightGBM 和 LSTM（WT-LightGBM-(AM)-LSTM）将小波增强特征选择结合起来的混合模型可以达到很高的准确度，对于提前 1 天和 7 天的 UWD 预测，Nash-Sutcliffe 效率均大于 0.95，Kling-Gupta 效率均大于 0.93。此外，在导致 UWD 突然变化的外部压力因素影响下，其性能也很稳定。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Environmental research: infrastructure and sustainability

CiteScore

2.70

自引率

0.00%

发文量