A novel method for leakage monitoring in Network-Level urban medium- and Low-Pressure natural gas pipelines combining information theory and Light Gradient Boosting
IF 8 1区 工程技术Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Zhengrun Huang , Xinming Qian , Pengliang Li , Xingyu Shen , Longfei Hou , Yuanzhi Li , Mengqi Yuan
{"title":"A novel method for leakage monitoring in Network-Level urban medium- and Low-Pressure natural gas pipelines combining information theory and Light Gradient Boosting","authors":"Zhengrun Huang , Xinming Qian , Pengliang Li , Xingyu Shen , Longfei Hou , Yuanzhi Li , Mengqi Yuan","doi":"10.1016/j.aei.2025.103309","DOIUrl":null,"url":null,"abstract":"<div><div>Leaks in urban medium- and low-pressure natural gas pipeline networks pose substantial risks and detection difficulties, compromising pipeline network reliability and urban safety. Little research has been conducted on leakage monitoring of network-level urban pipelines. This paper proposes a machine learning framework based on a dataset consisting of 141,236 samples collected over nearly five years. The framework real-time classifies the causes of anomalous signals (with leaks being one of the causes) collected by IoT terminals located near each section of pipelines within a large network at both the individual-sampling-point level and entire-incident level, thus enabling monitoring. Feature extraction is a crucial part of machine learning, relying solely on the sensor data and computational rules defined through diffusion laws and data analysis, requiring no prior information. The original feature set includes 360 features with physical significance. A novel iMICRS reduction method integrating information theory and rough set theory is developed to determine the optimal feature combination. Combining ten-fold cross-validation and Bayesian optimization, LightGBM achieves the highest ROC_AUC of 0.871 on a test set covering 9,885 sampling points. SHAP is used for prediction interpretation. The classification method for the entire incident based on the prediction results achieves a recall rate of 90% in diagnosing leaks (including multiple small leaks). This study provides an effective full-process engineering solution for leakage monitoring in urban medium- and low-pressure pipeline networks, based on actual operational data.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"65 ","pages":"Article 103309"},"PeriodicalIF":8.0000,"publicationDate":"2025-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Engineering Informatics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1474034625002022","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Leaks in urban medium- and low-pressure natural gas pipeline networks pose substantial risks and detection difficulties, compromising pipeline network reliability and urban safety. Little research has been conducted on leakage monitoring of network-level urban pipelines. This paper proposes a machine learning framework based on a dataset consisting of 141,236 samples collected over nearly five years. The framework real-time classifies the causes of anomalous signals (with leaks being one of the causes) collected by IoT terminals located near each section of pipelines within a large network at both the individual-sampling-point level and entire-incident level, thus enabling monitoring. Feature extraction is a crucial part of machine learning, relying solely on the sensor data and computational rules defined through diffusion laws and data analysis, requiring no prior information. The original feature set includes 360 features with physical significance. A novel iMICRS reduction method integrating information theory and rough set theory is developed to determine the optimal feature combination. Combining ten-fold cross-validation and Bayesian optimization, LightGBM achieves the highest ROC_AUC of 0.871 on a test set covering 9,885 sampling points. SHAP is used for prediction interpretation. The classification method for the entire incident based on the prediction results achieves a recall rate of 90% in diagnosing leaks (including multiple small leaks). This study provides an effective full-process engineering solution for leakage monitoring in urban medium- and low-pressure pipeline networks, based on actual operational data.
期刊介绍:
Advanced Engineering Informatics is an international Journal that solicits research papers with an emphasis on 'knowledge' and 'engineering applications'. The Journal seeks original papers that report progress in applying methods of engineering informatics. These papers should have engineering relevance and help provide a scientific base for more reliable, spontaneous, and creative engineering decision-making. Additionally, papers should demonstrate the science of supporting knowledge-intensive engineering tasks and validate the generality, power, and scalability of new methods through rigorous evaluation, preferably both qualitatively and quantitatively. Abstracting and indexing for Advanced Engineering Informatics include Science Citation Index Expanded, Scopus and INSPEC.