Cang Wu , Dong Wang , Min Luo , Wenpo Huang , Zexin Si
{"title":"基于随机森林学习的EWMA控制图对高维过程的非参数监测","authors":"Cang Wu , Dong Wang , Min Luo , Wenpo Huang , Zexin Si","doi":"10.1016/j.cie.2025.111111","DOIUrl":null,"url":null,"abstract":"<div><div>Multi-sensor systems have been widely used in manufacturing processes to perceive environmental or operating conditions and quality information, collecting large amounts of high-dimensional quality data. Because these systems are so common in practice, the important question has emerged as to how to leverage the full potential of rich data to monitor the various processes. Most traditional multivariate control charts in the field of high-dimensional data monitoring are no longer relevant because of their high dimensionality and typically unknown prior distribution of variables. In response to this need, this paper proposes a novel nonparametric Exponentially Weighted Moving Average (EWMA) control scheme by employing the random forest (RF) algorithm and log-likelihood method to transform high-dimensional data into one-dimensional data serving as the input of monitoring statistics. The simulation results indicate that the proposed control scheme outperforms its competitors in terms of various distributions and data types, especially for only one out-of-control (OC) cluster in the data stream. We also extend our scheme to the additional cases of a disk monitoring study and a breast cancer monitoring study prove the robustness and effectiveness of the proposed scheme.</div></div>","PeriodicalId":55220,"journal":{"name":"Computers & Industrial Engineering","volume":"204 ","pages":"Article 111111"},"PeriodicalIF":6.7000,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Nonparametric monitoring of high-dimensional processes via EWMA control charts based on random forest learning\",\"authors\":\"Cang Wu , Dong Wang , Min Luo , Wenpo Huang , Zexin Si\",\"doi\":\"10.1016/j.cie.2025.111111\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Multi-sensor systems have been widely used in manufacturing processes to perceive environmental or operating conditions and quality information, collecting large amounts of high-dimensional quality data. Because these systems are so common in practice, the important question has emerged as to how to leverage the full potential of rich data to monitor the various processes. Most traditional multivariate control charts in the field of high-dimensional data monitoring are no longer relevant because of their high dimensionality and typically unknown prior distribution of variables. In response to this need, this paper proposes a novel nonparametric Exponentially Weighted Moving Average (EWMA) control scheme by employing the random forest (RF) algorithm and log-likelihood method to transform high-dimensional data into one-dimensional data serving as the input of monitoring statistics. The simulation results indicate that the proposed control scheme outperforms its competitors in terms of various distributions and data types, especially for only one out-of-control (OC) cluster in the data stream. We also extend our scheme to the additional cases of a disk monitoring study and a breast cancer monitoring study prove the robustness and effectiveness of the proposed scheme.</div></div>\",\"PeriodicalId\":55220,\"journal\":{\"name\":\"Computers & Industrial Engineering\",\"volume\":\"204 \",\"pages\":\"Article 111111\"},\"PeriodicalIF\":6.7000,\"publicationDate\":\"2025-04-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computers & Industrial Engineering\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0360835225002578\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers & Industrial Engineering","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0360835225002578","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
Nonparametric monitoring of high-dimensional processes via EWMA control charts based on random forest learning
Multi-sensor systems have been widely used in manufacturing processes to perceive environmental or operating conditions and quality information, collecting large amounts of high-dimensional quality data. Because these systems are so common in practice, the important question has emerged as to how to leverage the full potential of rich data to monitor the various processes. Most traditional multivariate control charts in the field of high-dimensional data monitoring are no longer relevant because of their high dimensionality and typically unknown prior distribution of variables. In response to this need, this paper proposes a novel nonparametric Exponentially Weighted Moving Average (EWMA) control scheme by employing the random forest (RF) algorithm and log-likelihood method to transform high-dimensional data into one-dimensional data serving as the input of monitoring statistics. The simulation results indicate that the proposed control scheme outperforms its competitors in terms of various distributions and data types, especially for only one out-of-control (OC) cluster in the data stream. We also extend our scheme to the additional cases of a disk monitoring study and a breast cancer monitoring study prove the robustness and effectiveness of the proposed scheme.
期刊介绍:
Computers & Industrial Engineering (CAIE) is dedicated to researchers, educators, and practitioners in industrial engineering and related fields. Pioneering the integration of computers in research, education, and practice, industrial engineering has evolved to make computers and electronic communication integral to its domain. CAIE publishes original contributions focusing on the development of novel computerized methodologies to address industrial engineering problems. It also highlights the applications of these methodologies to issues within the broader industrial engineering and associated communities. The journal actively encourages submissions that push the boundaries of fundamental theories and concepts in industrial engineering techniques.