A novel method for leakage monitoring in Network-Level urban medium- and Low-Pressure natural gas pipelines combining information theory and Light Gradient Boosting

IF 8 1区 工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Zhengrun Huang , Xinming Qian , Pengliang Li , Xingyu Shen , Longfei Hou , Yuanzhi Li , Mengqi Yuan
{"title":"A novel method for leakage monitoring in Network-Level urban medium- and Low-Pressure natural gas pipelines combining information theory and Light Gradient Boosting","authors":"Zhengrun Huang ,&nbsp;Xinming Qian ,&nbsp;Pengliang Li ,&nbsp;Xingyu Shen ,&nbsp;Longfei Hou ,&nbsp;Yuanzhi Li ,&nbsp;Mengqi Yuan","doi":"10.1016/j.aei.2025.103309","DOIUrl":null,"url":null,"abstract":"<div><div>Leaks in urban medium- and low-pressure natural gas pipeline networks pose substantial risks and detection difficulties, compromising pipeline network reliability and urban safety. Little research has been conducted on leakage monitoring of network-level urban pipelines. This paper proposes a machine learning framework based on a dataset consisting of 141,236 samples collected over nearly five years. The framework real-time classifies the causes of anomalous signals (with leaks being one of the causes) collected by IoT terminals located near each section of pipelines within a large network at both the individual-sampling-point level and entire-incident level, thus enabling monitoring. Feature extraction is a crucial part of machine learning, relying solely on the sensor data and computational rules defined through diffusion laws and data analysis, requiring no prior information. The original feature set includes 360 features with physical significance. A novel iMICRS reduction method integrating information theory and rough set theory is developed to determine the optimal feature combination. Combining ten-fold cross-validation and Bayesian optimization, LightGBM achieves the highest ROC_AUC of 0.871 on a test set covering 9,885 sampling points. SHAP is used for prediction interpretation. The classification method for the entire incident based on the prediction results achieves a recall rate of 90% in diagnosing leaks (including multiple small leaks). This study provides an effective full-process engineering solution for leakage monitoring in urban medium- and low-pressure pipeline networks, based on actual operational data.</div></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"65 ","pages":"Article 103309"},"PeriodicalIF":8.0000,"publicationDate":"2025-04-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Engineering Informatics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1474034625002022","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Leaks in urban medium- and low-pressure natural gas pipeline networks pose substantial risks and detection difficulties, compromising pipeline network reliability and urban safety. Little research has been conducted on leakage monitoring of network-level urban pipelines. This paper proposes a machine learning framework based on a dataset consisting of 141,236 samples collected over nearly five years. The framework real-time classifies the causes of anomalous signals (with leaks being one of the causes) collected by IoT terminals located near each section of pipelines within a large network at both the individual-sampling-point level and entire-incident level, thus enabling monitoring. Feature extraction is a crucial part of machine learning, relying solely on the sensor data and computational rules defined through diffusion laws and data analysis, requiring no prior information. The original feature set includes 360 features with physical significance. A novel iMICRS reduction method integrating information theory and rough set theory is developed to determine the optimal feature combination. Combining ten-fold cross-validation and Bayesian optimization, LightGBM achieves the highest ROC_AUC of 0.871 on a test set covering 9,885 sampling points. SHAP is used for prediction interpretation. The classification method for the entire incident based on the prediction results achieves a recall rate of 90% in diagnosing leaks (including multiple small leaks). This study provides an effective full-process engineering solution for leakage monitoring in urban medium- and low-pressure pipeline networks, based on actual operational data.
结合信息论与光梯度增强的管网级城市中低压天然气管道泄漏监测新方法
城市中低压天然气管网泄漏给管网可靠性和城市安全带来巨大风险和检测困难。对于管网级城市管道泄漏监测的研究很少。本文提出了一个基于近五年收集的141236个样本的数据集的机器学习框架。该框架实时对位于大型网络内每段管道附近的物联网终端收集的异常信号(泄漏是原因之一)进行分类,包括单个采样点级别和整个事件级别,从而实现监控。特征提取是机器学习的关键部分,完全依赖于传感器数据和通过扩散规律和数据分析定义的计算规则,不需要先验信息。最初的特性集包括360个具有物理意义的特性。结合信息理论和粗糙集理论,提出了一种新的iMICRS约简方法来确定最优特征组合。结合十倍交叉验证和贝叶斯优化,LightGBM在覆盖9885个采样点的测试集上达到了0.871的最高ROC_AUC。SHAP用于预测解释。基于预测结果对整个事件进行分类的方法对泄漏(包括多个小泄漏)的诊断召回率达到90%。本研究基于实际运行数据,为城市中低压管网泄漏监测提供了有效的全过程工程解决方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Advanced Engineering Informatics
Advanced Engineering Informatics 工程技术-工程:综合
CiteScore
12.40
自引率
18.20%
发文量
292
审稿时长
45 days
期刊介绍: Advanced Engineering Informatics is an international Journal that solicits research papers with an emphasis on 'knowledge' and 'engineering applications'. The Journal seeks original papers that report progress in applying methods of engineering informatics. These papers should have engineering relevance and help provide a scientific base for more reliable, spontaneous, and creative engineering decision-making. Additionally, papers should demonstrate the science of supporting knowledge-intensive engineering tasks and validate the generality, power, and scalability of new methods through rigorous evaluation, preferably both qualitatively and quantitatively. Abstracting and indexing for Advanced Engineering Informatics include Science Citation Index Expanded, Scopus and INSPEC.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信