{"title":"Online Outlier Detection of Energy Data Streams Using Incremental and Kernel PCA Algorithms","authors":"Jeremiah D. Deng","doi":"10.1109/ICDMW.2016.0062","DOIUrl":null,"url":null,"abstract":"Outlier detection or anomaly detection is an important and challenging issue in data mining, even so in the domain of energy data mining where data are often collected in large amounts but with little labeled information. This paper presents a couple of online outlier detection algorithms based on principal component analysis. Novel algorithmic treatments are introduced to build incremental PCA and kernel PCA algorithms with online learning abilities. Some preliminary experimental results obtained from a real-world household consumption dataset have produced some promising performance for the proposed algorithms.","PeriodicalId":373866,"journal":{"name":"2016 IEEE 16th International Conference on Data Mining Workshops (ICDMW)","volume":"204 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 IEEE 16th International Conference on Data Mining Workshops (ICDMW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDMW.2016.0062","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 6
Abstract
Outlier detection or anomaly detection is an important and challenging issue in data mining, even so in the domain of energy data mining where data are often collected in large amounts but with little labeled information. This paper presents a couple of online outlier detection algorithms based on principal component analysis. Novel algorithmic treatments are introduced to build incremental PCA and kernel PCA algorithms with online learning abilities. Some preliminary experimental results obtained from a real-world household consumption dataset have produced some promising performance for the proposed algorithms.