无监督异常检测的序列集成方法

H. V. Nguyen, Trung-Thanh Nguyen, Nguyen Quang Uy
{"title":"无监督异常检测的序列集成方法","authors":"H. V. Nguyen, Trung-Thanh Nguyen, Nguyen Quang Uy","doi":"10.1109/KSE.2017.8119437","DOIUrl":null,"url":null,"abstract":"In data mining, anomaly detection aims at identifying the observations which do not conform to an expected behavior. To date, a large number of techniques for anomaly detection have been proposed and developed. Recently, researchers have paid their attention to ensemble methods to improve the accuracy of anomaly detection algorithms. Particularly, Sequential Ensemble Method (SEQ) proposed recently has shown significant improvement over other techniques. The idea of SEQ is to evaluate the scores of samples by using a second algorithm with respect to the first algorithm's output. In other words, an algorithm is firstly used to choose a set of the highest suspect abnormal samples (Dref) and then a second algorithm is applied to evaluate the final score of each data samples in the dataset with respect to only Dref. In this paper, we propose an improvement of SEQ by introducing a new way to build Dref that is based on the highest suspect normal samples instead of abnormal samples. The new algorithm is applied to a number of benchmark datasets. The experimental results show that the proposed method provided better and more stable performance compared to the previous version of SEQ and six individual algorithms.","PeriodicalId":159189,"journal":{"name":"2017 9th International Conference on Knowledge and Systems Engineering (KSE)","volume":"98 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"Sequential ensemble method for unsupervised anomaly detection\",\"authors\":\"H. V. Nguyen, Trung-Thanh Nguyen, Nguyen Quang Uy\",\"doi\":\"10.1109/KSE.2017.8119437\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In data mining, anomaly detection aims at identifying the observations which do not conform to an expected behavior. To date, a large number of techniques for anomaly detection have been proposed and developed. Recently, researchers have paid their attention to ensemble methods to improve the accuracy of anomaly detection algorithms. Particularly, Sequential Ensemble Method (SEQ) proposed recently has shown significant improvement over other techniques. The idea of SEQ is to evaluate the scores of samples by using a second algorithm with respect to the first algorithm's output. In other words, an algorithm is firstly used to choose a set of the highest suspect abnormal samples (Dref) and then a second algorithm is applied to evaluate the final score of each data samples in the dataset with respect to only Dref. In this paper, we propose an improvement of SEQ by introducing a new way to build Dref that is based on the highest suspect normal samples instead of abnormal samples. The new algorithm is applied to a number of benchmark datasets. The experimental results show that the proposed method provided better and more stable performance compared to the previous version of SEQ and six individual algorithms.\",\"PeriodicalId\":159189,\"journal\":{\"name\":\"2017 9th International Conference on Knowledge and Systems Engineering (KSE)\",\"volume\":\"98 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 9th International Conference on Knowledge and Systems Engineering (KSE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/KSE.2017.8119437\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 9th International Conference on Knowledge and Systems Engineering (KSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/KSE.2017.8119437","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

摘要

在数据挖掘中,异常检测的目的是识别不符合预期行为的观测值。迄今为止,已经提出并发展了大量的异常检测技术。近年来,研究人员开始关注集成方法,以提高异常检测算法的准确性。特别是最近提出的序列集成方法(sequence Ensemble Method, SEQ)比其他技术有了显著的改进。SEQ的思想是通过使用第二种算法相对于第一种算法的输出来评估样本的分数。换句话说,首先使用一种算法来选择一组最高可疑异常样本(Dref),然后使用第二种算法来评估数据集中每个数据样本相对于Dref的最终得分。本文提出了一种改进SEQ的方法,即引入一种基于最高可疑正态样本而不是异常样本来构建Dref的新方法。将该算法应用于多个基准数据集。实验结果表明,与以往版本的SEQ和6种单独的算法相比,该方法具有更好的性能和更稳定的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Sequential ensemble method for unsupervised anomaly detection
In data mining, anomaly detection aims at identifying the observations which do not conform to an expected behavior. To date, a large number of techniques for anomaly detection have been proposed and developed. Recently, researchers have paid their attention to ensemble methods to improve the accuracy of anomaly detection algorithms. Particularly, Sequential Ensemble Method (SEQ) proposed recently has shown significant improvement over other techniques. The idea of SEQ is to evaluate the scores of samples by using a second algorithm with respect to the first algorithm's output. In other words, an algorithm is firstly used to choose a set of the highest suspect abnormal samples (Dref) and then a second algorithm is applied to evaluate the final score of each data samples in the dataset with respect to only Dref. In this paper, we propose an improvement of SEQ by introducing a new way to build Dref that is based on the highest suspect normal samples instead of abnormal samples. The new algorithm is applied to a number of benchmark datasets. The experimental results show that the proposed method provided better and more stable performance compared to the previous version of SEQ and six individual algorithms.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信