{"title":"Multi-step time-to-failure predictions in water pipelines using feature engineering and cascading ensembles","authors":"Beenish Bakhtawar, Tarek Zayed, Husnain Arshad","doi":"10.1016/j.watres.2025.124253","DOIUrl":null,"url":null,"abstract":"<div><div>Predicting failure timings in water pipelines is crucial for actionable predictive maintenance and rehabilitation planning of water distribution networks. However, existing time-to-failure prediction models have limited capability to incorporate failure history and determine sequential failures in individual pipeline sections. Secondly, accuracy of these models is hampered by lack of in-depth investigation and selection of most significant predictors of failure timings from historical data. As dynamic features can better determine time-based deterioration impacts, the study develops a customized weather index, and other interaction features for accuracy enhancements. Furthermore, feature selection is further automated for optimized performance with MAE ranges:1.4–0.5 for the developed models. Overall, GA-based feature selection and feature engineering results in a 20–50 % increase in the model performance, with highest reported performance when compared with existing models. Finally, a cascading ensemble for predicting first, second and third failure of individual pipelines is proposed, tested and validated using both hold-out and out-of-sample testing, exhibiting higher performance (MAE:0.8–1.1) than alternative multi-output models. Demonstrated using a web-based application, the developed study offers a novel modeling regime for high performance failure timings prediction of water pipelines, offering a micro-level analyses of pipe sections, giving useful insights into the complex interactions of features for indirectly gauging deterioration rate in water networks.</div></div>","PeriodicalId":443,"journal":{"name":"Water Research","volume":"286 ","pages":"Article 124253"},"PeriodicalIF":12.4000,"publicationDate":"2025-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Water Research","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0043135425011595","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
Predicting failure timings in water pipelines is crucial for actionable predictive maintenance and rehabilitation planning of water distribution networks. However, existing time-to-failure prediction models have limited capability to incorporate failure history and determine sequential failures in individual pipeline sections. Secondly, accuracy of these models is hampered by lack of in-depth investigation and selection of most significant predictors of failure timings from historical data. As dynamic features can better determine time-based deterioration impacts, the study develops a customized weather index, and other interaction features for accuracy enhancements. Furthermore, feature selection is further automated for optimized performance with MAE ranges:1.4–0.5 for the developed models. Overall, GA-based feature selection and feature engineering results in a 20–50 % increase in the model performance, with highest reported performance when compared with existing models. Finally, a cascading ensemble for predicting first, second and third failure of individual pipelines is proposed, tested and validated using both hold-out and out-of-sample testing, exhibiting higher performance (MAE:0.8–1.1) than alternative multi-output models. Demonstrated using a web-based application, the developed study offers a novel modeling regime for high performance failure timings prediction of water pipelines, offering a micro-level analyses of pipe sections, giving useful insights into the complex interactions of features for indirectly gauging deterioration rate in water networks.
期刊介绍:
Water Research, along with its open access companion journal Water Research X, serves as a platform for publishing original research papers covering various aspects of the science and technology related to the anthropogenic water cycle, water quality, and its management worldwide. The audience targeted by the journal comprises biologists, chemical engineers, chemists, civil engineers, environmental engineers, limnologists, and microbiologists. The scope of the journal include:
•Treatment processes for water and wastewaters (municipal, agricultural, industrial, and on-site treatment), including resource recovery and residuals management;
•Urban hydrology including sewer systems, stormwater management, and green infrastructure;
•Drinking water treatment and distribution;
•Potable and non-potable water reuse;
•Sanitation, public health, and risk assessment;
•Anaerobic digestion, solid and hazardous waste management, including source characterization and the effects and control of leachates and gaseous emissions;
•Contaminants (chemical, microbial, anthropogenic particles such as nanoparticles or microplastics) and related water quality sensing, monitoring, fate, and assessment;
•Anthropogenic impacts on inland, tidal, coastal and urban waters, focusing on surface and ground waters, and point and non-point sources of pollution;
•Environmental restoration, linked to surface water, groundwater and groundwater remediation;
•Analysis of the interfaces between sediments and water, and between water and atmosphere, focusing specifically on anthropogenic impacts;
•Mathematical modelling, systems analysis, machine learning, and beneficial use of big data related to the anthropogenic water cycle;
•Socio-economic, policy, and regulations studies.