Deovrat Kakde, Sergiy Peredriy, A. Chaudhuri, Anya McGuirk
{"title":"高频多变量数据的非参数控制图","authors":"Deovrat Kakde, Sergiy Peredriy, A. Chaudhuri, Anya McGuirk","doi":"10.1109/RAM.2017.7889786","DOIUrl":null,"url":null,"abstract":"Support Vector Data Description (SVDD) is a machine learning technique used for single class classification and outlier detection. A SVDD based K-chart was first introduced by Sun and Tsung [4]. K-chart provides an attractive alternative to the traditional control charts such as the Hotelling's T2 charts when the distribution of the underlying multivariate data is either non-normal or is unknown. But there are challenges when the K-chart is deployed in practice. The K-chart requires calculating the kernel distance of each new observation but there are no guidelines on how to interpret the kernel distance plot and draw inferences about shifts in process mean or changes in process variation. This limits the application of K-charts in big-data applications such as equipment health monitoring, where observations are generated at a very high frequency. In this scenario, the analyst using the K-chart is inundated with kernel distance results at a very high frequency, generally without any recourse for detecting presence of any assignable causes of variation. We propose a new SVDD based control chart, called a kT chart, which addresses the challenges encountered when using a K-chart for big-data applications. The kT charts can be used to track simultaneously process variation and central tendency.","PeriodicalId":138871,"journal":{"name":"2017 Annual Reliability and Maintainability Symposium (RAMS)","volume":"80 3","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"A non-parametric control chart for high frequency multivariate data\",\"authors\":\"Deovrat Kakde, Sergiy Peredriy, A. Chaudhuri, Anya McGuirk\",\"doi\":\"10.1109/RAM.2017.7889786\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Support Vector Data Description (SVDD) is a machine learning technique used for single class classification and outlier detection. A SVDD based K-chart was first introduced by Sun and Tsung [4]. K-chart provides an attractive alternative to the traditional control charts such as the Hotelling's T2 charts when the distribution of the underlying multivariate data is either non-normal or is unknown. But there are challenges when the K-chart is deployed in practice. The K-chart requires calculating the kernel distance of each new observation but there are no guidelines on how to interpret the kernel distance plot and draw inferences about shifts in process mean or changes in process variation. This limits the application of K-charts in big-data applications such as equipment health monitoring, where observations are generated at a very high frequency. In this scenario, the analyst using the K-chart is inundated with kernel distance results at a very high frequency, generally without any recourse for detecting presence of any assignable causes of variation. We propose a new SVDD based control chart, called a kT chart, which addresses the challenges encountered when using a K-chart for big-data applications. The kT charts can be used to track simultaneously process variation and central tendency.\",\"PeriodicalId\":138871,\"journal\":{\"name\":\"2017 Annual Reliability and Maintainability Symposium (RAMS)\",\"volume\":\"80 3\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-07-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 Annual Reliability and Maintainability Symposium (RAMS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/RAM.2017.7889786\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 Annual Reliability and Maintainability Symposium (RAMS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/RAM.2017.7889786","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A non-parametric control chart for high frequency multivariate data
Support Vector Data Description (SVDD) is a machine learning technique used for single class classification and outlier detection. A SVDD based K-chart was first introduced by Sun and Tsung [4]. K-chart provides an attractive alternative to the traditional control charts such as the Hotelling's T2 charts when the distribution of the underlying multivariate data is either non-normal or is unknown. But there are challenges when the K-chart is deployed in practice. The K-chart requires calculating the kernel distance of each new observation but there are no guidelines on how to interpret the kernel distance plot and draw inferences about shifts in process mean or changes in process variation. This limits the application of K-charts in big-data applications such as equipment health monitoring, where observations are generated at a very high frequency. In this scenario, the analyst using the K-chart is inundated with kernel distance results at a very high frequency, generally without any recourse for detecting presence of any assignable causes of variation. We propose a new SVDD based control chart, called a kT chart, which addresses the challenges encountered when using a K-chart for big-data applications. The kT charts can be used to track simultaneously process variation and central tendency.