A multi-level classification model for corrosion defects in oil and gas pipelines using meta-learner ensemble (MLE) techniques

IF 4.8 Q2 ENERGY & FUELS

Journal of Pipeline Science and Engineering Pub Date : 2025-06-01 DOI:10.1016/j.jpse.2024.100244

Adamu Abubakar Sani , Mohamed Mubarak Abdul Wahab , Nasir Shafiq , Kamaludden Usman Danyaro , Nasir Khan , Adamu Tafida , Arsalaan Khan Yousafzai

{"title":"A multi-level classification model for corrosion defects in oil and gas pipelines using meta-learner ensemble (MLE) techniques","authors":"Adamu Abubakar Sani , Mohamed Mubarak Abdul Wahab , Nasir Shafiq , Kamaludden Usman Danyaro , Nasir Khan , Adamu Tafida , Arsalaan Khan Yousafzai","doi":"10.1016/j.jpse.2024.100244","DOIUrl":null,"url":null,"abstract":"<div><div>Maintaining the integrity of oil and gas pipelines is necessary for the efficient and safe transport of hydrocarbons. Corrosion defects can lead to decreased operational efficiency, leaks, a reduction in operational efficiency, and even catastrophic pipeline failures. Machine learning techniques are useful in detecting corrosion defects, ensemble methods that combine multiple classifiers offer better predictive accuracy. The aim of this work is to develop multi-level classification model an efficient ensemble technique capable of predicting the level of corrosion defects and addressing class imbalances in oil and gas pipeline data. The study uses a two-level stacking ensemble learning method that enhances corrosion defect prediction called the meta-learner ensemble (MLE). The model classifies corrosion defects into three categories: high, medium, and low. Prediction accuracy was improved by using a stacking classifier that combines multiple basic classifiers with a logistic regression meta-learner. Findings show that most corrosion defects fall within the low-risk category, with a significant number falling into the medium-to-high-risk range, highlighting the necessity for targeted maintenance. Considering the challenges of dataset imbalance, the stacking classifier achieved 94% accuracy, showing balanced performance across all risk categories. The stacking model outperformed the random forest and logistic regression models in terms of F1-scores, precision, and recall. This study demonstrates the application of stacking ensemble techniques in predicting corrosion risks and optimizing pipeline maintenance strategies. It provides vital information for improving pipeline safety and optimizing predictive maintenance practices by providing an in-depth assessment of various machine learning models, especially when real-time monitoring systems are integrated.</div></div>","PeriodicalId":100824,"journal":{"name":"Journal of Pipeline Science and Engineering","volume":"5 2","pages":"Article 100244"},"PeriodicalIF":4.8000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Pipeline Science and Engineering","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2667143324000714","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENERGY & FUELS","Score":null,"Total":0}

引用次数: 0

Abstract

Maintaining the integrity of oil and gas pipelines is necessary for the efficient and safe transport of hydrocarbons. Corrosion defects can lead to decreased operational efficiency, leaks, a reduction in operational efficiency, and even catastrophic pipeline failures. Machine learning techniques are useful in detecting corrosion defects, ensemble methods that combine multiple classifiers offer better predictive accuracy. The aim of this work is to develop multi-level classification model an efficient ensemble technique capable of predicting the level of corrosion defects and addressing class imbalances in oil and gas pipeline data. The study uses a two-level stacking ensemble learning method that enhances corrosion defect prediction called the meta-learner ensemble (MLE). The model classifies corrosion defects into three categories: high, medium, and low. Prediction accuracy was improved by using a stacking classifier that combines multiple basic classifiers with a logistic regression meta-learner. Findings show that most corrosion defects fall within the low-risk category, with a significant number falling into the medium-to-high-risk range, highlighting the necessity for targeted maintenance. Considering the challenges of dataset imbalance, the stacking classifier achieved 94% accuracy, showing balanced performance across all risk categories. The stacking model outperformed the random forest and logistic regression models in terms of F1-scores, precision, and recall. This study demonstrates the application of stacking ensemble techniques in predicting corrosion risks and optimizing pipeline maintenance strategies. It provides vital information for improving pipeline safety and optimizing predictive maintenance practices by providing an in-depth assessment of various machine learning models, especially when real-time monitoring systems are integrated.

查看原文本刊更多论文

基于元学习器集成（MLE）技术的油气管道腐蚀缺陷多级分类模型

保持油气管道的完整性对于碳氢化合物的高效和安全运输是必要的。腐蚀缺陷会导致作业效率下降、泄漏、作业效率降低，甚至导致灾难性的管道故障。机器学习技术在检测腐蚀缺陷方面很有用，组合多个分类器的集成方法提供了更好的预测准确性。这项工作的目的是开发多级分类模型，一种有效的集成技术，能够预测腐蚀缺陷的水平，并解决石油和天然气管道数据中的类别不平衡问题。该研究使用了一种叫做元学习集成（MLE）的两级堆叠集成学习方法来增强腐蚀缺陷预测。该模型将腐蚀缺陷分为高、中、低三类。通过将多个基本分类器与逻辑回归元学习器相结合的堆叠分类器提高预测精度。研究结果表明，大多数腐蚀缺陷属于低风险类别，有相当数量的腐蚀缺陷属于中高风险范围，这突出了有针对性维护的必要性。考虑到数据不平衡的挑战，叠加分类器达到了94%的准确率，在所有风险类别中表现出平衡的性能。在f1得分、精度和召回率方面，叠加模型优于随机森林和逻辑回归模型。本研究展示了堆叠集成技术在预测管道腐蚀风险和优化管道维护策略中的应用。通过对各种机器学习模型进行深入评估，特别是在集成实时监控系统时，它为提高管道安全性和优化预测性维护实践提供了重要信息。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Pipeline Science and Engineering

CiteScore

7.50

自引率

0.00%

发文量