Zhonglin Zuo , Hao Zhang , Zheng Li , Li Ma , Shan Liang , Tong Liu , Mehmet Mercangöz
{"title":"A self-supervised leak detection method for natural gas gathering pipelines considering unlabeled multi-class non-leak data","authors":"Zhonglin Zuo , Hao Zhang , Zheng Li , Li Ma , Shan Liang , Tong Liu , Mehmet Mercangöz","doi":"10.1016/j.compind.2024.104102","DOIUrl":null,"url":null,"abstract":"<div><p>Detecting leaks in natural gas gathering pipelines is paramount for the safe and reliable operation of the gas and oil industry. Due to the lack of leak data and the changes in leak features, semi-supervised leak detection methods that use normal data for health model learning have attracted much attention. However, these approaches usually consider one-class normal samples as health data, which may fail to fit the reality of unlabeled multi-class non-leak data under variable operating conditions. In addition, existing semi-supervised methods often suffer from insufficient representation learning as they employ step-by-step training or rely on the low-level reconstruction of autoencoders. To address the above two key challenges, this paper proposes a novel end-to-end self-supervised leak detection method, self-supervised multi-sphere support vector data description. Specifically, it utilizes the presented multi-sphere support vector data description to model unlabeled multi-class non-leak data and the introduced self-supervised learning strategy to boost the representation learning of the end-to-end semi-supervised model. Moreover, the categories of unlabeled multi-class non-leak data are learned in an unsupervised way through alternating feature clustering and pseudo-label-based classification. A robust leak score calculation method is also designed to improve the performance of the proposed method. Finally, the experimental results on the field data collected from pipelines demonstrate the effectiveness of the proposed method.</p></div>","PeriodicalId":55219,"journal":{"name":"Computers in Industry","volume":"159 ","pages":"Article 104102"},"PeriodicalIF":8.2000,"publicationDate":"2024-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in Industry","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0166361524000307","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Detecting leaks in natural gas gathering pipelines is paramount for the safe and reliable operation of the gas and oil industry. Due to the lack of leak data and the changes in leak features, semi-supervised leak detection methods that use normal data for health model learning have attracted much attention. However, these approaches usually consider one-class normal samples as health data, which may fail to fit the reality of unlabeled multi-class non-leak data under variable operating conditions. In addition, existing semi-supervised methods often suffer from insufficient representation learning as they employ step-by-step training or rely on the low-level reconstruction of autoencoders. To address the above two key challenges, this paper proposes a novel end-to-end self-supervised leak detection method, self-supervised multi-sphere support vector data description. Specifically, it utilizes the presented multi-sphere support vector data description to model unlabeled multi-class non-leak data and the introduced self-supervised learning strategy to boost the representation learning of the end-to-end semi-supervised model. Moreover, the categories of unlabeled multi-class non-leak data are learned in an unsupervised way through alternating feature clustering and pseudo-label-based classification. A robust leak score calculation method is also designed to improve the performance of the proposed method. Finally, the experimental results on the field data collected from pipelines demonstrate the effectiveness of the proposed method.
期刊介绍:
The objective of Computers in Industry is to present original, high-quality, application-oriented research papers that:
• Illuminate emerging trends and possibilities in the utilization of Information and Communication Technology in industry;
• Establish connections or integrations across various technology domains within the expansive realm of computer applications for industry;
• Foster connections or integrations across diverse application areas of ICT in industry.