Big data oriented root cause identification approach based on PCA and SVM for product infant failure

Zhenzhen He, Yihai He, Yi Wei
{"title":"Big data oriented root cause identification approach based on PCA and SVM for product infant failure","authors":"Zhenzhen He, Yihai He, Yi Wei","doi":"10.1109/PHM.2016.7819776","DOIUrl":null,"url":null,"abstract":"Due to the increasing complexity and huge number of uncontrolled operational factors in manufacturing, the produced product usually comes with an exceptional high infant failure rate, and the root cause identification of product infant failure has been a very challenging issue for manufacturers. Especially in the era of big data, the large number of data could be collected from the product life cycle easily, these high-dimensional big data always bear many un-correlation noise information, which has caused serious problem that not only the accuracy may not be remarkable, but also the model-training time may be redundant to most of the current small data-driven method. Furthermore, traditional small data oriented analytic techniques are not applicable to the new big data environment. In order to solve this dilemma, this paper proposed a new method to identify the root cause of infant failure from the big data using the principal component analysis (PCA) and support vector machine (SVM). Firstly, data collected from design, manufacturing, and usage related to product infant failure mechanism has been divided into training data and test data. Secondly, PCA is applied to eliminate redundancy and reducing data dimension of original process feature parameters from raw data in low-dimensional space so that the key variables as the potential root cause candidates can be extracted. Thirdly, an SVM-based optimal hyper-plane to separate these candidates' features data is presented, and one-versus-all SVM classifier is designed to identify the final list of the root cause for infant failure by radial basis kernel function. Finally, the feasibility and validity of the proposed methods are demonstrated through a case study of computer board failure analysis.","PeriodicalId":202597,"journal":{"name":"2016 Prognostics and System Health Management Conference (PHM-Chengdu)","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 Prognostics and System Health Management Conference (PHM-Chengdu)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/PHM.2016.7819776","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Due to the increasing complexity and huge number of uncontrolled operational factors in manufacturing, the produced product usually comes with an exceptional high infant failure rate, and the root cause identification of product infant failure has been a very challenging issue for manufacturers. Especially in the era of big data, the large number of data could be collected from the product life cycle easily, these high-dimensional big data always bear many un-correlation noise information, which has caused serious problem that not only the accuracy may not be remarkable, but also the model-training time may be redundant to most of the current small data-driven method. Furthermore, traditional small data oriented analytic techniques are not applicable to the new big data environment. In order to solve this dilemma, this paper proposed a new method to identify the root cause of infant failure from the big data using the principal component analysis (PCA) and support vector machine (SVM). Firstly, data collected from design, manufacturing, and usage related to product infant failure mechanism has been divided into training data and test data. Secondly, PCA is applied to eliminate redundancy and reducing data dimension of original process feature parameters from raw data in low-dimensional space so that the key variables as the potential root cause candidates can be extracted. Thirdly, an SVM-based optimal hyper-plane to separate these candidates' features data is presented, and one-versus-all SVM classifier is designed to identify the final list of the root cause for infant failure by radial basis kernel function. Finally, the feasibility and validity of the proposed methods are demonstrated through a case study of computer board failure analysis.
基于主成分分析和支持向量机的产品故障大数据根本原因识别方法
由于生产过程中不受控制的操作因素越来越多,生产出来的产品通常具有异常高的婴儿故障率,产品婴儿失效的根本原因识别一直是制造商非常具有挑战性的问题。特别是在大数据时代,可以很容易地从产品生命周期中收集到大量数据,这些高维大数据总是带有许多不相关的噪声信息,这对于目前大多数小数据驱动方法来说,不仅精度可能不显著,而且模型训练时间可能冗余,造成了严重的问题。此外,传统的面向小数据的分析技术并不适用于新的大数据环境。为了解决这一难题,本文提出了一种利用主成分分析(PCA)和支持向量机(SVM)从大数据中识别婴儿故障根本原因的新方法。首先,从设计、制造和使用中收集到的与产品婴儿失效机制相关的数据分为训练数据和测试数据。其次,利用主成分分析法在低维空间中对原始过程特征参数进行去冗余和降维处理,提取出作为潜在根本原因候选者的关键变量;第三,提出了基于支持向量机的最优超平面分离候选特征数据,并设计了单对全支持向量机分类器,利用径向基核函数识别出婴儿故障的最终根本原因列表。最后,通过计算机主板故障分析实例验证了所提方法的可行性和有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信