{"title":"Sequential ensemble method for unsupervised anomaly detection","authors":"H. V. Nguyen, Trung-Thanh Nguyen, Nguyen Quang Uy","doi":"10.1109/KSE.2017.8119437","DOIUrl":null,"url":null,"abstract":"In data mining, anomaly detection aims at identifying the observations which do not conform to an expected behavior. To date, a large number of techniques for anomaly detection have been proposed and developed. Recently, researchers have paid their attention to ensemble methods to improve the accuracy of anomaly detection algorithms. Particularly, Sequential Ensemble Method (SEQ) proposed recently has shown significant improvement over other techniques. The idea of SEQ is to evaluate the scores of samples by using a second algorithm with respect to the first algorithm's output. In other words, an algorithm is firstly used to choose a set of the highest suspect abnormal samples (Dref) and then a second algorithm is applied to evaluate the final score of each data samples in the dataset with respect to only Dref. In this paper, we propose an improvement of SEQ by introducing a new way to build Dref that is based on the highest suspect normal samples instead of abnormal samples. The new algorithm is applied to a number of benchmark datasets. The experimental results show that the proposed method provided better and more stable performance compared to the previous version of SEQ and six individual algorithms.","PeriodicalId":159189,"journal":{"name":"2017 9th International Conference on Knowledge and Systems Engineering (KSE)","volume":"98 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 9th International Conference on Knowledge and Systems Engineering (KSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/KSE.2017.8119437","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
In data mining, anomaly detection aims at identifying the observations which do not conform to an expected behavior. To date, a large number of techniques for anomaly detection have been proposed and developed. Recently, researchers have paid their attention to ensemble methods to improve the accuracy of anomaly detection algorithms. Particularly, Sequential Ensemble Method (SEQ) proposed recently has shown significant improvement over other techniques. The idea of SEQ is to evaluate the scores of samples by using a second algorithm with respect to the first algorithm's output. In other words, an algorithm is firstly used to choose a set of the highest suspect abnormal samples (Dref) and then a second algorithm is applied to evaluate the final score of each data samples in the dataset with respect to only Dref. In this paper, we propose an improvement of SEQ by introducing a new way to build Dref that is based on the highest suspect normal samples instead of abnormal samples. The new algorithm is applied to a number of benchmark datasets. The experimental results show that the proposed method provided better and more stable performance compared to the previous version of SEQ and six individual algorithms.