A self-supervised leak detection method for natural gas gathering pipelines considering unlabeled multi-class non-leak data

IF 8.2 1区 计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
Zhonglin Zuo , Hao Zhang , Zheng Li , Li Ma , Shan Liang , Tong Liu , Mehmet Mercangöz
{"title":"A self-supervised leak detection method for natural gas gathering pipelines considering unlabeled multi-class non-leak data","authors":"Zhonglin Zuo ,&nbsp;Hao Zhang ,&nbsp;Zheng Li ,&nbsp;Li Ma ,&nbsp;Shan Liang ,&nbsp;Tong Liu ,&nbsp;Mehmet Mercangöz","doi":"10.1016/j.compind.2024.104102","DOIUrl":null,"url":null,"abstract":"<div><p>Detecting leaks in natural gas gathering pipelines is paramount for the safe and reliable operation of the gas and oil industry. Due to the lack of leak data and the changes in leak features, semi-supervised leak detection methods that use normal data for health model learning have attracted much attention. However, these approaches usually consider one-class normal samples as health data, which may fail to fit the reality of unlabeled multi-class non-leak data under variable operating conditions. In addition, existing semi-supervised methods often suffer from insufficient representation learning as they employ step-by-step training or rely on the low-level reconstruction of autoencoders. To address the above two key challenges, this paper proposes a novel end-to-end self-supervised leak detection method, self-supervised multi-sphere support vector data description. Specifically, it utilizes the presented multi-sphere support vector data description to model unlabeled multi-class non-leak data and the introduced self-supervised learning strategy to boost the representation learning of the end-to-end semi-supervised model. Moreover, the categories of unlabeled multi-class non-leak data are learned in an unsupervised way through alternating feature clustering and pseudo-label-based classification. A robust leak score calculation method is also designed to improve the performance of the proposed method. Finally, the experimental results on the field data collected from pipelines demonstrate the effectiveness of the proposed method.</p></div>","PeriodicalId":55219,"journal":{"name":"Computers in Industry","volume":"159 ","pages":"Article 104102"},"PeriodicalIF":8.2000,"publicationDate":"2024-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in Industry","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0166361524000307","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

Detecting leaks in natural gas gathering pipelines is paramount for the safe and reliable operation of the gas and oil industry. Due to the lack of leak data and the changes in leak features, semi-supervised leak detection methods that use normal data for health model learning have attracted much attention. However, these approaches usually consider one-class normal samples as health data, which may fail to fit the reality of unlabeled multi-class non-leak data under variable operating conditions. In addition, existing semi-supervised methods often suffer from insufficient representation learning as they employ step-by-step training or rely on the low-level reconstruction of autoencoders. To address the above two key challenges, this paper proposes a novel end-to-end self-supervised leak detection method, self-supervised multi-sphere support vector data description. Specifically, it utilizes the presented multi-sphere support vector data description to model unlabeled multi-class non-leak data and the introduced self-supervised learning strategy to boost the representation learning of the end-to-end semi-supervised model. Moreover, the categories of unlabeled multi-class non-leak data are learned in an unsupervised way through alternating feature clustering and pseudo-label-based classification. A robust leak score calculation method is also designed to improve the performance of the proposed method. Finally, the experimental results on the field data collected from pipelines demonstrate the effectiveness of the proposed method.

考虑无标记多类非泄漏数据的天然气集输管道自监督泄漏检测方法
检测天然气集输管道的泄漏对天然气和石油工业的安全可靠运行至关重要。由于泄漏数据的缺乏和泄漏特征的变化,利用正常数据进行健康模型学习的半监督泄漏检测方法备受关注。然而,这些方法通常将单类正常样本视为健康数据,这可能无法适应多变运行条件下无标记多类非泄漏数据的实际情况。此外,现有的半监督方法往往存在表征学习不足的问题,因为它们采用逐步训练或依赖于自编码器的低级重构。针对上述两个关键挑战,本文提出了一种新颖的端到端自监督泄漏检测方法--自监督多球支持向量数据描述。具体来说,它利用提出的多球体支持向量数据描述对未标记的多类非泄漏数据进行建模,并利用引入的自监督学习策略促进端到端半监督模型的表示学习。此外,通过交替使用特征聚类和基于伪标签的分类,以无监督的方式学习了无标签多类非泄漏数据的类别。此外,还设计了一种稳健的泄漏分数计算方法,以提高所提方法的性能。最后,在管道采集的现场数据上的实验结果证明了所提方法的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Computers in Industry
Computers in Industry 工程技术-计算机:跨学科应用
CiteScore
18.90
自引率
8.00%
发文量
152
审稿时长
22 days
期刊介绍: The objective of Computers in Industry is to present original, high-quality, application-oriented research papers that: • Illuminate emerging trends and possibilities in the utilization of Information and Communication Technology in industry; • Establish connections or integrations across various technology domains within the expansive realm of computer applications for industry; • Foster connections or integrations across diverse application areas of ICT in industry.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信