{"title":"Multi-objective weighted average algorithm: a novel algorithm for multi-objective optimization problems and its application in engineering problems","authors":"Jun Cheng, Wim De Waele","doi":"10.1016/j.engappai.2025.111569","DOIUrl":"10.1016/j.engappai.2025.111569","url":null,"abstract":"<div><div>Numerous meta-heuristic algorithms struggle with degraded performance when addressing multi-objective optimization problems due to the challenge of balancing two goals: accurately estimating Pareto-optimal solutions and ensuring their broad distribution across objectives. While the Weighted Average Algorithm (WAA) excels in single-objective optimization, its scalarization-based mechanism fundamentally conflicts with multi-objective requirements. To bridge this gap, we propose the Multi-Objective Weighted Average Algorithm (MOWAA) with three key innovations: (1) a hybrid exploration-exploitation mechanism integrating adaptive mutation and crossover operations; (2) an elitist archive management system using efficient non-dominated sorting across three critical solution sets; and (3) a novel roulette-wheel-based leader selection strategy that dynamically balances convergence and diversity. To verify the performance of the developed MOWAA, the numerical benchmark test functions (CEC2009, ZDT and DTLZ) and four engineering problems (the Binh and Korn (BNH), Constraint (CONSTR), Srinivas and Deb (SRN), and 10-bar Truss (BAR TRUSS)) are used in comparison with three multi-objective optimization algorithms. The results show that MOWAA achieves better optimization performance than comparative algorithms, with Pareto-optimal solutions exhibiting excellent convergence and coverage. Finally, applying MOWAA to an Artificial Neural Network (ANN) model using an experimental dataset on surface waviness (in mm) of Wire Arc Additive Manufacturing (WAAM) components enhances predictive accuracy by balancing optimization of prediction error and variance. Compared to single-objective optimization methods, the MOWAA approach effectively captures the complex relationships between process parameters and waviness in the WAAM process.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"159 ","pages":"Article 111569"},"PeriodicalIF":7.5,"publicationDate":"2025-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144522960","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Zhenshen Qu , Xinxu Cai , Tianyi Zhang , Jiazheng Xu , Xintong Jiang , Chuan Lin
{"title":"Real-time inner wall surface defect detection based on multi-morphological feature fusion network","authors":"Zhenshen Qu , Xinxu Cai , Tianyi Zhang , Jiazheng Xu , Xintong Jiang , Chuan Lin","doi":"10.1016/j.engappai.2025.111331","DOIUrl":"10.1016/j.engappai.2025.111331","url":null,"abstract":"<div><div>In industrial manufacturing, defects on the inner wall surface are crucial for quality and safety assessment. However, existing detection methods are limited by low resolution and glare interference. This study presents a Multi-morphological Feature Fusion Network for Object Detection (MFFN-OD) for 360°detection of inner wall image defects. First, it cleverly integrates panoramic imaging with conventional features through a dual-branch backbone and annular features, ensuring rotation invariance and holistic feature preservation. Second, we develop an Adaptive Multi-morphological Feature Alignment Module (AMFAM) that combats centrally polarized defects by automatically adjusting feature alignment, reducing noise, and increasing accuracy, as well as a feature interaction module with a focus on strengthening multiscale feature fusion. Third, we introduce an Asymptotic Feature Pyramid Network with Auxiliary Features (AFPN-AF) to further refine fusion, close semantic gaps, and improve performance. Experimental results show that MFFN-OD achieves 96.1% mean Average Precision (mAP) and 94.3% Average Precision (AP) for demanding faults with fast detection of 17 milliseconds per frame, meeting industrial requirements for accuracy and real-time performance.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"159 ","pages":"Article 111331"},"PeriodicalIF":7.5,"publicationDate":"2025-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144522961","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Xiaoyu Liu , Shengze Cai , Hongtao Lin , Xingli Liu , Xiuhua Hu , Longjiang Zhang , Qi Gao
{"title":"CorNet: A deep learning method based on physics-guided and attention mechanism for predicting flow field of coronary arterial tree","authors":"Xiaoyu Liu , Shengze Cai , Hongtao Lin , Xingli Liu , Xiuhua Hu , Longjiang Zhang , Qi Gao","doi":"10.1016/j.engappai.2025.111460","DOIUrl":"10.1016/j.engappai.2025.111460","url":null,"abstract":"<div><div>Replacing traditional computational fluid dynamics (CFD) with deep learning techniques has become a prevalent approach for studying blood flow and diagnosing diseases. Nevertheless, no neural network has been specifically designed for tree-like blood vessel structures that can effectively capture their bifurcation patterns and branching dependencies. Coronary neural network (CorNet) was proposed for predicting the pressure field and fractional flow reserve (FFR<span><math><msub><mrow></mrow><mrow><mi>CorNet</mi></mrow></msub></math></span>) of the coronary artery tree. The novelty of this framework lies in utilizing self-attention mechanisms to address long-term spatial dependencies within the vascular tree. Our dataset comprised 295 coronary arterial trees from 273 patients, each including point clouds reconstructed from medical images and fractional flow reserve values calculated by CFD (FFR<span><math><msub><mrow></mrow><mrow><mi>CT</mi></mrow></msub></math></span>). Physical constraints were incorporated to mitigate data sparsity and enhance the interpretability of the neural network. The pressure results predicted by CorNet are consistent with the pressure calculated by CFD (mean relative error = 3.96%). There is also a good consistency between FFR<span><math><msub><mrow></mrow><mrow><mi>CT</mi></mrow></msub></math></span> and FFR<span><math><msub><mrow></mrow><mrow><mi>CorNet</mi></mrow></msub></math></span>.Compared to invasive fractional flow reserve, which is considered the “gold standard”, FFR<span><math><msub><mrow></mrow><mrow><mi>CorNet</mi></mrow></msub></math></span> demonstrates accuracy comparable to FFR<span><math><msub><mrow></mrow><mrow><mi>CT</mi></mrow></msub></math></span> (88% vs. 90%) while reducing computation time by several thousand-fold. Compared to previous studies, CorNet eliminates the need to identify specific lesion sites or manually extract geometric parameters of stenotic segments. This is the first computational method to predict hemodynamics in three-dimensional vascular tree structures using attention mechanisms within a deep learning model. We foresee that this framework will enable near-real-time flow field predictions for arterial trees and offer valuable insights for cardiovascular disease treatment.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"159 ","pages":"Article 111460"},"PeriodicalIF":7.5,"publicationDate":"2025-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144523254","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"A review on data-driven prognostics and health management for wind turbine systems","authors":"Mi Yan , Siu Cheung Hui , Na Jiang , Ning Li","doi":"10.1016/j.engappai.2025.111484","DOIUrl":"10.1016/j.engappai.2025.111484","url":null,"abstract":"<div><div>The wind power industry has developed rapidly due to the transformation of the global energy matrix. Prognostics and health management have attracted significant attention from industries and academia to ensure the reliability and safety of wind turbines, reduce maintenance costs, and increase productivity. Currently, most wind farms use supervisory control and data acquisition systems to collect, record, and store wind turbine operating data. Data-driven prognostics and health management of wind turbine systems have become the most commonly used methods for real-time monitoring and fault alarm prediction. Although some reviews of studies on data-driven prognostics and health management for wind turbine systems are available, they are structured mainly based on the processing tasks or key components of wind turbine systems, and have overlooked the challenges of data quality behind these processes. Different from previous works, this paper reviews the current developments of data-driven prognostics and health management for wind turbine systems from the perspective of data challenges, which include data availability, data labeling, data scarcity, data imbalance, data inconsistency, dynamic data and fleet-based data. In this paper, we discuss some open datasets and current data challenges in data-driven prognostics and health management for wind turbine systems, and review the related methods proposed for addressing these data issues. We then provide practical applications to help engineers understand data-driven prognostics and health management better. Finally, future research directions are suggested for further work on data-driven prognostics and health management for wind turbine systems.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"159 ","pages":"Article 111484"},"PeriodicalIF":7.5,"publicationDate":"2025-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144523214","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Onder Civelek , Sedat Gormus , H.İbrahim Okumus , Hasan Yilmaz
{"title":"Addressing anomalies in smart grids: Methods for detecting and localizing high-current loads","authors":"Onder Civelek , Sedat Gormus , H.İbrahim Okumus , Hasan Yilmaz","doi":"10.1016/j.engappai.2025.111223","DOIUrl":"10.1016/j.engappai.2025.111223","url":null,"abstract":"<div><div>The concept of a smart grid encompasses a wide range of advanced technologies that surpass the capabilities of traditional power grids, enabling enhanced monitoring, control and efficiency. In this context, the term “anomaly” denotes unusual or unexpected occurrences, such as abnormal consumption patterns, infrastructure failures, power outages, cyber attacks, or energy theft. Anomaly detection is a critical aspect of improving the reliability and operational efficiency of Smart grid systems. In this study, we focus on detecting high current loads as anomalies in low voltage networks and present a system designed for this purpose. The system architecture includes distribution transformers and customer meters equipped with current sensors, radio frequency (RF) modems for wireless data transmission, and a central management server for data analysis and storage. The system employs machine learning (ML) algorithms for real-time anomaly detection and localization. Therefore, in the feature extraction phase, three distinct methods are utilized: Pattern Analysis, Discrete Wavelet Transform (DWT), and Fast Walsh–Hadamard Transform (FHWT). The Pattern Analysis method achieved the best performance using the Matern Gaussian Process Regression (MGPR) approach, whereas the DWT and FHWT methods yielded the most accurate results with the Optimizable Boosting Method. The results of the Pattern Analysis method indicate a test root mean square error (RMSE) of 69.36, a coefficient of determination (R<sup>2</sup>) of 0.97, and a mean absolute error (MAE) of 45.40. The DWT method, employing a five-level wavelet transform with Daubechies 18 as the main wavelet, achieved a test RMSE of 53.44, an R<sup>2</sup> of 0.98, and an MAE of 37.25. The FHWT method resulted in a test RMSE of 83.99, an R<sup>2</sup> of 0.95, and an MAE of 53.71. Among these methods, the DWT method demonstrated the highest accuracy, with an average error rate of 3.02%, while the Pattern Analysis method and the FHWT method exhibited error rates of 4.09% and 4.86%, respectively. This innovative anomaly detection and localization approach provides a robust foundation for future research and development in power distribution networks, with the potential to reduce energy losses and improve grid stability. This system holds significant promise for reducing energy losses, preventing equipment damage, and enhancing grid stability by enabling rapid identification of unauthorized consumption and fault conditions. Its real-time monitoring capability empowers utilities to mitigate risks associated with energy theft and transient overloads, directly contributing to operational cost savings and improved service reliability.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"159 ","pages":"Article 111223"},"PeriodicalIF":7.5,"publicationDate":"2025-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144523255","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Stability analysis of a homogeneous rock slope under steady-state seepage using artificial neural networks","authors":"M.A. Millán , R. Galindo","doi":"10.1016/j.engappai.2025.111556","DOIUrl":"10.1016/j.engappai.2025.111556","url":null,"abstract":"<div><div>An accurate and effective procedure to determine the stability of rock slopes is paramount during the design and, even more so, during the management and maintenance of transportation infrastructures, when the rock slopes should be continuously monitored and checked against changing environmental conditions. The stability of rock slopes may be assessed using stability charts or limit equilibrium methods that assume an equivalent linear Mohr-Coulomb failure criterion for the rock or numerical models that can deal with complex configurations and rock behavior but demand more computational resources, expertise, and advanced software. In this study, an artificial neural network (ANN) was used to predict the safety factor of a two-dimensional, homogeneous rock slope incorporating several critical factors simultaneously, such as the rock's non-linear failure behavior, rock dilatancy, and different levels of the groundwater table associated with steady-state seepage flow through the rock. This greatly extends previous contributions in the field. A two-hidden layers ANN was trained using thousands of numerical simulations applying the Discontinuity Layout Optimization model, considering the rock mass toughness coefficient, non-dimensional height of the slope, artificial slope angle, dilatancy, and non-dimensional water level position. The ANN predictions closely matched the numerical results, demonstrating its potential as an easier and more accessible method for accurately evaluating the safety factor of rock slopes and as a reliable alternative to traditional numerical methods within specified input ranges. Its implementation is straightforward, using simple equations provided in the article, and it is easily scalable to perform intensive tasks and help complex engineering decision-making.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"159 ","pages":"Article 111556"},"PeriodicalIF":7.5,"publicationDate":"2025-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144523215","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"ESRPCB: An edge guided super-Resolution model and ensemble learning for tiny Printed Circuit Board defect detection","authors":"Xiem HoangVan , Dang Bui Dinh , Thanh Nguyen Canh , Van-Truong Nguyen","doi":"10.1016/j.engappai.2025.111547","DOIUrl":"10.1016/j.engappai.2025.111547","url":null,"abstract":"<div><div>Printed Circuit Boards (PCBs) are critical components in modern electronics, which require stringent quality control to ensure proper functionality. However, the detection of defects in small-scale PCBs images poses significant challenges as a result of the low resolution of the captured images, leading to potential confusion between defects and noise. To overcome these challenges, this paper proposes a novel framework, named ESRPCB (edge-guided super-resolution for PCBs defect detection), which combines edge-guided super-resolution with ensemble learning to enhance PCBs defect detection. The framework leverages the edge information to guide the EDSR (Enhanced Deep Super-Resolution) model with a novel ResCat (Residual Concatenation) structure, enabling it to reconstruct high-resolution images from small PCBs inputs. By incorporating edge features, the super-resolution process preserves critical structural details, ensuring that tiny defects remain distinguishable in the enhanced image. Following this, a multi-modal defect detection model employs ensemble learning to analyze the super-resolved image, improving the accuracy of defect identification. Experimental results demonstrate that ESRPCB achieves superior performance compared to State-of-the-Art (SOTA) methods, achieving an average Peak Signal to Noise Ratio (PSNR) of 30.54 <span><math><mrow><mi>d</mi><mi>B</mi><mrow><mo>(</mo><mi>d</mi><mi>e</mi><mi>c</mi><mi>i</mi><mi>b</mi><mi>e</mi><mi>l</mi><mo>)</mo></mrow></mrow></math></span>, surpassing EDSR by <span><math><mrow><mn>0</mn><mo>.</mo><mn>42</mn><mi>d</mi><mi>B</mi></mrow></math></span>. In defect detection, ESRPCB achieves a mAP50(mean average precision at an Intersection over Union threshold of 0.50) of 0.965, surpassing EDSR (0.905) and traditional super-resolution models by over 5%. Furthermore, the ensemble-based detection approach further enhances performance, achieving a mAP50 of 0.977. These results highlight the effectiveness of ESRPCB in enhancing both image quality and defect detection accuracy, particularly in challenging low-resolution scenarios.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"159 ","pages":"Article 111547"},"PeriodicalIF":7.5,"publicationDate":"2025-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144523122","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Aiman Tariq, Büşra Uzun, Babür Deliktaş, Mustafa Özgür Yaylı
{"title":"Bending analysis of cantilever microbeams with three porosity distributions using physics-informed neural network and modified couple stress theory","authors":"Aiman Tariq, Büşra Uzun, Babür Deliktaş, Mustafa Özgür Yaylı","doi":"10.1016/j.engappai.2025.111589","DOIUrl":"10.1016/j.engappai.2025.111589","url":null,"abstract":"<div><div>This study explores the use of a Physics-Informed Neural Network (PINN) framework to investigate the bending behavior of a cantilever microbeam made of porous material. PINN is a powerful approach that combines machine learning with physics principles to address the challenges of limited training data and enforce domain knowledge into the learning process, making them effective surrogate solvers for Partial Differential Equations (PDEs). In this work, a cantilever microbeam subjected to a uniformly distributed transverse load is examined, considering three different pore distributions including homogeneous, symmetric, and non-symmetric. The bending analysis incorporates the size effect by integrating the modified couple stress theory with the Euler-Bernoulli beam theory. First, the bending equation based on the modified couple stress theory is extended to include porous material properties. The governing equation is then solved using the Laplace transform. The PINN model is trained to approximate the solution by minimizing a loss function that accounts for residual errors at collocation points, as well as initial and boundary conditions. To enhance computational efficiency, the optimal hyperparameters of the PINN model are determined using a combination of Taguchi design of experiments and the Grey Relational Method. Taguchi-Grey approach effectively captures the trade-off between these objectives by normalizing and aggregating them into a single value to reflect the overall performance. The results are validated against analytical solutions based on Laplace transform, and the influence of key parameters such as microbeam length, length scale parameter, and porosity is systematically investigated.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"159 ","pages":"Article 111589"},"PeriodicalIF":7.5,"publicationDate":"2025-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144523167","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
{"title":"Investigation of the dependence of fill factor and efficiency on gallium-doped silicon wafer characteristics using unsupervised learning","authors":"Denish Hirpara , Paramsinh Zala , Meenakshi Bhaisare , Chandra Mauli Kumar , Mayank Gupta , Manoj Kumar , Brijesh Tripathi","doi":"10.1016/j.engappai.2025.111598","DOIUrl":"10.1016/j.engappai.2025.111598","url":null,"abstract":"<div><div>This research analyses the performance parameters of bifacial silicon solar cells using the production line data of gallium-doped silicon wafers through unsupervised machine learning models and reports suitable input parameters. Gallium doping offers enhanced electronic properties, mitigating the light-induced degradation commonly seen in boron-doped silicon, thus making it a promising material for high-efficiency solar cells. The impact of silicon wafer thickness, resistivity, and carrier lifetime on fill factor, open-circuit voltage, and efficiency has been investigated. Employing unsupervised learning models, extensive production line data has been analysed to elucidate the complex dependencies between input parameters of silicon wafers (thickness, resistivity, and carrier lifetime) and performance parameters of fabricated solar cells using these silicon wafers (fill factor, open-circuit voltage, and efficiency). The findings indicate significant dependency of performance parameters on input parameters. Random forest classifier model demonstrated robust predictive capabilities, providing valuable insights for optimizing manufacturing processes. Based on analysis with unsupervised learning of data, following conclusions are drawn for input parameters of silicon wafers: (i) thickness range of 160–164 micro-meter gives the highest fill factor, (ii) resistivity range of 0.5–0.8 Ohm-centimetre is superior for higher fill factor, (iii) carrier lifetime of 0.9–1.05 micro-second results in better fill factor. A combination of all the above three parameters works in favour of high fill factor, it may not be a good idea to achieve one and deviate from the others. The results contribute to guiding future material and process optimizations, pushing the boundaries of solar cell efficiency.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"159 ","pages":"Article 111598"},"PeriodicalIF":7.5,"publicationDate":"2025-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144523211","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
Lucas Gouveia Omena Lopes , Thales Miranda de Almeida Vieira , Pedro Esteves Aranha , Eduardo Toledo de Lima Junior , William Wagner Matos Lira
{"title":"Detection and classification of anomalies in oil well production using Open-World Learning","authors":"Lucas Gouveia Omena Lopes , Thales Miranda de Almeida Vieira , Pedro Esteves Aranha , Eduardo Toledo de Lima Junior , William Wagner Matos Lira","doi":"10.1016/j.engappai.2025.111514","DOIUrl":"10.1016/j.engappai.2025.111514","url":null,"abstract":"<div><div>Accurate anomaly detection and classification are critical for operational safety and efficiency in oil well production. While machine learning methods can identify known anomalies, detecting and classifying previously unseen anomalies remains a challenge. This paper presents the first Open-World Learning strategy applied to anomalies in oil well production data. The strategy detects anomalous behavior, classifies whether it belongs to a known labeled anomaly, and, if not, clusters it into newly proposed anomaly classes and learns to classify them. The approach integrates autoencoder reconstruction error for anomaly detection, autoencoder-based dimensionality reduction to extract latent features, binary classifiers to classify known anomalies, and clustering methods to group similar unseen anomalies. If an anomaly is detected via reconstruction error, the binary classifiers determine whether it belongs to a known class. If it does not, the clustering method groups similar events into new classes, which are validated by human experts. This validation enables the training of specific binary classifiers for the new classes and updates existing ones. Experiments on real anomalous oil well production data demonstrate that the discovered clusters align well with ground-truth labels. The clustering methodology achieves 81% accuracy overall, exceeding 95% for certain anomalies, while updated binary classifiers reach up to 99% accuracy. These findings demonstrate the proposed method’s effectiveness in dynamically adapting to novel anomalies, improving classification accuracy, and enhancing oil well monitoring.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"159 ","pages":"Article 111514"},"PeriodicalIF":7.5,"publicationDate":"2025-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144523216","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}