{"title":"Big data oriented root cause identification approach based on PCA and SVM for product infant failure","authors":"Zhenzhen He, Yihai He, Yi Wei","doi":"10.1109/PHM.2016.7819776","DOIUrl":null,"url":null,"abstract":"Due to the increasing complexity and huge number of uncontrolled operational factors in manufacturing, the produced product usually comes with an exceptional high infant failure rate, and the root cause identification of product infant failure has been a very challenging issue for manufacturers. Especially in the era of big data, the large number of data could be collected from the product life cycle easily, these high-dimensional big data always bear many un-correlation noise information, which has caused serious problem that not only the accuracy may not be remarkable, but also the model-training time may be redundant to most of the current small data-driven method. Furthermore, traditional small data oriented analytic techniques are not applicable to the new big data environment. In order to solve this dilemma, this paper proposed a new method to identify the root cause of infant failure from the big data using the principal component analysis (PCA) and support vector machine (SVM). Firstly, data collected from design, manufacturing, and usage related to product infant failure mechanism has been divided into training data and test data. Secondly, PCA is applied to eliminate redundancy and reducing data dimension of original process feature parameters from raw data in low-dimensional space so that the key variables as the potential root cause candidates can be extracted. Thirdly, an SVM-based optimal hyper-plane to separate these candidates' features data is presented, and one-versus-all SVM classifier is designed to identify the final list of the root cause for infant failure by radial basis kernel function. Finally, the feasibility and validity of the proposed methods are demonstrated through a case study of computer board failure analysis.","PeriodicalId":202597,"journal":{"name":"2016 Prognostics and System Health Management Conference (PHM-Chengdu)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 Prognostics and System Health Management Conference (PHM-Chengdu)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PHM.2016.7819776","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Due to the increasing complexity and huge number of uncontrolled operational factors in manufacturing, the produced product usually comes with an exceptional high infant failure rate, and the root cause identification of product infant failure has been a very challenging issue for manufacturers. Especially in the era of big data, the large number of data could be collected from the product life cycle easily, these high-dimensional big data always bear many un-correlation noise information, which has caused serious problem that not only the accuracy may not be remarkable, but also the model-training time may be redundant to most of the current small data-driven method. Furthermore, traditional small data oriented analytic techniques are not applicable to the new big data environment. In order to solve this dilemma, this paper proposed a new method to identify the root cause of infant failure from the big data using the principal component analysis (PCA) and support vector machine (SVM). Firstly, data collected from design, manufacturing, and usage related to product infant failure mechanism has been divided into training data and test data. Secondly, PCA is applied to eliminate redundancy and reducing data dimension of original process feature parameters from raw data in low-dimensional space so that the key variables as the potential root cause candidates can be extracted. Thirdly, an SVM-based optimal hyper-plane to separate these candidates' features data is presented, and one-versus-all SVM classifier is designed to identify the final list of the root cause for infant failure by radial basis kernel function. Finally, the feasibility and validity of the proposed methods are demonstrated through a case study of computer board failure analysis.