{"title":"Enhanced process monitoring using machine learning-based control charts for poisson-distributed data","authors":"Faraz Mukhtiar , Babar Zaman , Naveed Razzaq Butt","doi":"10.1016/j.engappai.2025.111227","DOIUrl":null,"url":null,"abstract":"<div><div>The ability to detect shifts (e.g., outliers) in process monitoring is crucial for maintaining high-quality standards and operational efficiency in industrial environments. Control Charts (CCs) provide an organized framework for recognizing and managing anomalies, generally caused by assignable factors (e.g., identifiable issues) rather than inherent process variation. Traditional CCs, such as classical Shewhart, CUSUM, and EWMA, are commonly used to monitor Poisson observations in modern industries. The classical exponentially weighted moving average (EWMA) and cumulative sum (CUSUM) CCs are individually effective at detecting small-to-moderate shifts while Shewhart CCs identify moderate-to-large shifts in the process location and/or dispersion parameters. However, the classical CCs face limitations due to their sensitivity being constrained to specific ranges of shifts. To enhance the detection abilities of classical CCs in detecting all kinds of shifts in the process location parameter, this study proposes the integration of Machine Learning (ML) techniques into CCs to optimize the shift’s detection in process location parameter across a wider range. This study generates a dataset using the statistics of classical CCs based on Poisson-distributed data, which includes both in-control (stable process) and out-of-control (unstable process) processes. This dataset is used to train ML models, which are pre-processed through normalization and feature engineering through a heuristic approach before training. The performance of ML models is evaluated using standard regression metrics, specifically mean squared error and the coefficient of determination (<span><math><msup><mrow><mi>R</mi></mrow><mrow><mn>2</mn></mrow></msup></math></span>-score), to ensure effective generalization across varying process conditions. After training, these models are implemented within the proposed ML-based CC (<span><math><mrow><mi>M</mi><mi>L</mi><mi>C</mi><mi>C</mi></mrow></math></span>) schemes. Their process monitoring capabilities are then rigorously compared with traditional and existing CCs, utilizing key performance indicators such as average run length and standard deviation of run length. These metrics are computed through a Python-based algorithm developed using the Monte Carlo simulation method. For practical purposes, implementing the proposed <span><math><mrow><mi>M</mi><mi>L</mi><mi>C</mi><mi>C</mi></mrow></math></span> schemes with real-life data in the food processing industry, specifically the packaging of frozen orange juice concentrate. This practical example highlights the superiority of proposed <span><math><mrow><mi>M</mi><mi>L</mi><mi>C</mi><mi>C</mi></mrow></math></span> schemes in the early detection of shift(s) in process location parameter(s) against classical CCs in real-life scenarios.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"157 ","pages":"Article 111227"},"PeriodicalIF":7.5000,"publicationDate":"2025-06-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S095219762501228X","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
The ability to detect shifts (e.g., outliers) in process monitoring is crucial for maintaining high-quality standards and operational efficiency in industrial environments. Control Charts (CCs) provide an organized framework for recognizing and managing anomalies, generally caused by assignable factors (e.g., identifiable issues) rather than inherent process variation. Traditional CCs, such as classical Shewhart, CUSUM, and EWMA, are commonly used to monitor Poisson observations in modern industries. The classical exponentially weighted moving average (EWMA) and cumulative sum (CUSUM) CCs are individually effective at detecting small-to-moderate shifts while Shewhart CCs identify moderate-to-large shifts in the process location and/or dispersion parameters. However, the classical CCs face limitations due to their sensitivity being constrained to specific ranges of shifts. To enhance the detection abilities of classical CCs in detecting all kinds of shifts in the process location parameter, this study proposes the integration of Machine Learning (ML) techniques into CCs to optimize the shift’s detection in process location parameter across a wider range. This study generates a dataset using the statistics of classical CCs based on Poisson-distributed data, which includes both in-control (stable process) and out-of-control (unstable process) processes. This dataset is used to train ML models, which are pre-processed through normalization and feature engineering through a heuristic approach before training. The performance of ML models is evaluated using standard regression metrics, specifically mean squared error and the coefficient of determination (-score), to ensure effective generalization across varying process conditions. After training, these models are implemented within the proposed ML-based CC () schemes. Their process monitoring capabilities are then rigorously compared with traditional and existing CCs, utilizing key performance indicators such as average run length and standard deviation of run length. These metrics are computed through a Python-based algorithm developed using the Monte Carlo simulation method. For practical purposes, implementing the proposed schemes with real-life data in the food processing industry, specifically the packaging of frozen orange juice concentrate. This practical example highlights the superiority of proposed schemes in the early detection of shift(s) in process location parameter(s) against classical CCs in real-life scenarios.
期刊介绍:
Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.