{"title":"基于机器学习的管道在线检测数据离群点检测","authors":"Muhammad Hussain, Tieling Zhang","doi":"10.1016/j.ress.2024.110553","DOIUrl":null,"url":null,"abstract":"<div><div>Pipeline companies are facing challenges in maintaining the integrity and reliability of their pipelines. They are working towards predictive maintenance using machine learning-based approaches to predicting anomalies. Training machine learning models requires sufficient data. Data quality is therefore becoming important because inaccurate data will lead to an inaccurate or wrong decision on pipeline condition assessment and the following management. This research paper intends to address the data quality issues of pipeline inspection data such as in-line inspection (ILI) data using machine learning models. Different machine learning models developed by random forest regression, linear regression, and nearest neighbors’ methods were tested to detect outliers in the ILI data. In this paper, the ILI data collected from an oil pipeline over a period of 22 years was applied to testing and analysis. To verify the outlier detection results of machine learning models, we used statistical analysis including Z-score method to check and find if there are any gaps in the analysis. It verifies that all these methods show almost the same or very similar results for the detection of the outliers. Hence, this study presents a robust method for the field applications in the pipeline industry.</div></div>","PeriodicalId":54500,"journal":{"name":"Reliability Engineering & System Safety","volume":null,"pages":null},"PeriodicalIF":9.4000,"publicationDate":"2024-10-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine learning-based outlier detection for pipeline in-line inspection data\",\"authors\":\"Muhammad Hussain, Tieling Zhang\",\"doi\":\"10.1016/j.ress.2024.110553\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Pipeline companies are facing challenges in maintaining the integrity and reliability of their pipelines. They are working towards predictive maintenance using machine learning-based approaches to predicting anomalies. Training machine learning models requires sufficient data. Data quality is therefore becoming important because inaccurate data will lead to an inaccurate or wrong decision on pipeline condition assessment and the following management. This research paper intends to address the data quality issues of pipeline inspection data such as in-line inspection (ILI) data using machine learning models. Different machine learning models developed by random forest regression, linear regression, and nearest neighbors’ methods were tested to detect outliers in the ILI data. In this paper, the ILI data collected from an oil pipeline over a period of 22 years was applied to testing and analysis. To verify the outlier detection results of machine learning models, we used statistical analysis including Z-score method to check and find if there are any gaps in the analysis. It verifies that all these methods show almost the same or very similar results for the detection of the outliers. Hence, this study presents a robust method for the field applications in the pipeline industry.</div></div>\",\"PeriodicalId\":54500,\"journal\":{\"name\":\"Reliability Engineering & System Safety\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":9.4000,\"publicationDate\":\"2024-10-12\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Reliability Engineering & System Safety\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0951832024006252\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, INDUSTRIAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Reliability Engineering & System Safety","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0951832024006252","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, INDUSTRIAL","Score":null,"Total":0}
引用次数: 0
摘要
管道公司在维护管道完整性和可靠性方面面临挑战。他们正在利用基于机器学习的方法来预测异常情况,从而实现预测性维护。训练机器学习模型需要足够的数据。因此,数据质量变得越来越重要,因为不准确的数据会导致管道状况评估和后续管理决策的不准确或错误。本研究论文旨在利用机器学习模型解决管道检测数据(如在线检测 (ILI) 数据)的数据质量问题。本文测试了由随机森林回归、线性回归和最近邻方法开发的不同机器学习模型,以检测 ILI 数据中的异常值。本文将从一条输油管道收集到的长达 22 年的 ILI 数据用于测试和分析。为了验证机器学习模型的离群值检测结果,我们使用了统计分析方法,包括 Zcore 方法,以检查和发现分析中是否存在任何漏洞。结果表明,所有这些方法对异常值的检测结果几乎相同或非常相似。因此,本研究提出了一种适用于管道行业现场应用的稳健方法。
Machine learning-based outlier detection for pipeline in-line inspection data
Pipeline companies are facing challenges in maintaining the integrity and reliability of their pipelines. They are working towards predictive maintenance using machine learning-based approaches to predicting anomalies. Training machine learning models requires sufficient data. Data quality is therefore becoming important because inaccurate data will lead to an inaccurate or wrong decision on pipeline condition assessment and the following management. This research paper intends to address the data quality issues of pipeline inspection data such as in-line inspection (ILI) data using machine learning models. Different machine learning models developed by random forest regression, linear regression, and nearest neighbors’ methods were tested to detect outliers in the ILI data. In this paper, the ILI data collected from an oil pipeline over a period of 22 years was applied to testing and analysis. To verify the outlier detection results of machine learning models, we used statistical analysis including Z-score method to check and find if there are any gaps in the analysis. It verifies that all these methods show almost the same or very similar results for the detection of the outliers. Hence, this study presents a robust method for the field applications in the pipeline industry.
期刊介绍:
Elsevier publishes Reliability Engineering & System Safety in association with the European Safety and Reliability Association and the Safety Engineering and Risk Analysis Division. The international journal is devoted to developing and applying methods to enhance the safety and reliability of complex technological systems, like nuclear power plants, chemical plants, hazardous waste facilities, space systems, offshore and maritime systems, transportation systems, constructed infrastructure, and manufacturing plants. The journal normally publishes only articles that involve the analysis of substantive problems related to the reliability of complex systems or present techniques and/or theoretical results that have a discernable relationship to the solution of such problems. An important aim is to balance academic material and practical applications.