{"title":"多变量离群标记的鲁棒峰度投影","authors":"D. Herwindiati, Rahmat Sagara, J. Hendryli","doi":"10.1109/ICACSIS.2015.7415151","DOIUrl":null,"url":null,"abstract":"Outlier labeling can be considered as an early procedure to get the information of `suspects'. This paper introducesrobust kurtosis projection algorithm for multivariate outlier labeling of data set with moderate, high and very high percentage outlier. The algorithm works in two stages. In the first stage, we propose a projection approach to findthe orthonormal set of all vectors that maximize the kurtosis of the projected standardized data. In the second stage, we estimate robust covariance matrix minimizing vector variance to label high dimensional outliers. In this stage, we use the robust estimator on the lower-dimensional data space to identify the suspected anomolous observations. The simulation experiments reveal that theintroduced algorithm has a good performance to identify an anomalous observation hidden in a moderate, high, and very high percentage of contamination data and it seems to work well in data analysis.","PeriodicalId":325539,"journal":{"name":"2015 International Conference on Advanced Computer Science and Information Systems (ICACSIS)","volume":"89 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Robust kurtosis projection for multivariate outlier labeling\",\"authors\":\"D. Herwindiati, Rahmat Sagara, J. Hendryli\",\"doi\":\"10.1109/ICACSIS.2015.7415151\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Outlier labeling can be considered as an early procedure to get the information of `suspects'. This paper introducesrobust kurtosis projection algorithm for multivariate outlier labeling of data set with moderate, high and very high percentage outlier. The algorithm works in two stages. In the first stage, we propose a projection approach to findthe orthonormal set of all vectors that maximize the kurtosis of the projected standardized data. In the second stage, we estimate robust covariance matrix minimizing vector variance to label high dimensional outliers. In this stage, we use the robust estimator on the lower-dimensional data space to identify the suspected anomolous observations. The simulation experiments reveal that theintroduced algorithm has a good performance to identify an anomalous observation hidden in a moderate, high, and very high percentage of contamination data and it seems to work well in data analysis.\",\"PeriodicalId\":325539,\"journal\":{\"name\":\"2015 International Conference on Advanced Computer Science and Information Systems (ICACSIS)\",\"volume\":\"89 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 International Conference on Advanced Computer Science and Information Systems (ICACSIS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICACSIS.2015.7415151\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 International Conference on Advanced Computer Science and Information Systems (ICACSIS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICACSIS.2015.7415151","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Robust kurtosis projection for multivariate outlier labeling
Outlier labeling can be considered as an early procedure to get the information of `suspects'. This paper introducesrobust kurtosis projection algorithm for multivariate outlier labeling of data set with moderate, high and very high percentage outlier. The algorithm works in two stages. In the first stage, we propose a projection approach to findthe orthonormal set of all vectors that maximize the kurtosis of the projected standardized data. In the second stage, we estimate robust covariance matrix minimizing vector variance to label high dimensional outliers. In this stage, we use the robust estimator on the lower-dimensional data space to identify the suspected anomolous observations. The simulation experiments reveal that theintroduced algorithm has a good performance to identify an anomalous observation hidden in a moderate, high, and very high percentage of contamination data and it seems to work well in data analysis.