{"title":"A multi-level classification model for corrosion defects in oil and gas pipelines using meta-learner ensemble (MLE) techniques","authors":"Adamu Abubakar Sani , Mohamed Mubarak Abdul Wahab , Nasir Shafiq , Kamaludden Usman Danyaro , Nasir Khan , Adamu Tafida , Arsalaan Khan Yousafzai","doi":"10.1016/j.jpse.2024.100244","DOIUrl":null,"url":null,"abstract":"<div><div>Maintaining the integrity of oil and gas pipelines is necessary for the efficient and safe transport of hydrocarbons. Corrosion defects can lead to decreased operational efficiency, leaks, a reduction in operational efficiency, and even catastrophic pipeline failures. Machine learning techniques are useful in detecting corrosion defects, ensemble methods that combine multiple classifiers offer better predictive accuracy. The aim of this work is to develop multi-level classification model an efficient ensemble technique capable of predicting the level of corrosion defects and addressing class imbalances in oil and gas pipeline data. The study uses a two-level stacking ensemble learning method that enhances corrosion defect prediction called the meta-learner ensemble (MLE). The model classifies corrosion defects into three categories: high, medium, and low. Prediction accuracy was improved by using a stacking classifier that combines multiple basic classifiers with a logistic regression meta-learner. Findings show that most corrosion defects fall within the low-risk category, with a significant number falling into the medium-to-high-risk range, highlighting the necessity for targeted maintenance. Considering the challenges of dataset imbalance, the stacking classifier achieved 94% accuracy, showing balanced performance across all risk categories. The stacking model outperformed the random forest and logistic regression models in terms of F1-scores, precision, and recall. This study demonstrates the application of stacking ensemble techniques in predicting corrosion risks and optimizing pipeline maintenance strategies. It provides vital information for improving pipeline safety and optimizing predictive maintenance practices by providing an in-depth assessment of various machine learning models, especially when real-time monitoring systems are integrated.</div></div>","PeriodicalId":100824,"journal":{"name":"Journal of Pipeline Science and Engineering","volume":"5 2","pages":"Article 100244"},"PeriodicalIF":4.8000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Pipeline Science and Engineering","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2667143324000714","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENERGY & FUELS","Score":null,"Total":0}
引用次数: 0
Abstract
Maintaining the integrity of oil and gas pipelines is necessary for the efficient and safe transport of hydrocarbons. Corrosion defects can lead to decreased operational efficiency, leaks, a reduction in operational efficiency, and even catastrophic pipeline failures. Machine learning techniques are useful in detecting corrosion defects, ensemble methods that combine multiple classifiers offer better predictive accuracy. The aim of this work is to develop multi-level classification model an efficient ensemble technique capable of predicting the level of corrosion defects and addressing class imbalances in oil and gas pipeline data. The study uses a two-level stacking ensemble learning method that enhances corrosion defect prediction called the meta-learner ensemble (MLE). The model classifies corrosion defects into three categories: high, medium, and low. Prediction accuracy was improved by using a stacking classifier that combines multiple basic classifiers with a logistic regression meta-learner. Findings show that most corrosion defects fall within the low-risk category, with a significant number falling into the medium-to-high-risk range, highlighting the necessity for targeted maintenance. Considering the challenges of dataset imbalance, the stacking classifier achieved 94% accuracy, showing balanced performance across all risk categories. The stacking model outperformed the random forest and logistic regression models in terms of F1-scores, precision, and recall. This study demonstrates the application of stacking ensemble techniques in predicting corrosion risks and optimizing pipeline maintenance strategies. It provides vital information for improving pipeline safety and optimizing predictive maintenance practices by providing an in-depth assessment of various machine learning models, especially when real-time monitoring systems are integrated.