Clustering-Based Outlier Detection Method

Sheng-yi Jiang, Q. An
{"title":"Clustering-Based Outlier Detection Method","authors":"Sheng-yi Jiang, Q. An","doi":"10.1109/FSKD.2008.244","DOIUrl":null,"url":null,"abstract":"Outlier detection is important in many fields. The concept about outlier factor of object is extended to the case of cluster. Based on outlier factor of cluster, a clustering-based outlier detection method, named CBOD, is presented. The method consists of two stages, the first stage cluster dataset by one-pass clustering algorithm and second stage determine outlier cluster by outlier factor. The time complexity of CBOD is nearly linear with the size of dataset and the number of attributes, which results in good scalability and adapts to large dataset. The theoretic analysis and the experimental results show that the detection method is effective and practicable.","PeriodicalId":208332,"journal":{"name":"2008 Fifth International Conference on Fuzzy Systems and Knowledge Discovery","volume":"305 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"97","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 Fifth International Conference on Fuzzy Systems and Knowledge Discovery","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/FSKD.2008.244","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 97

Abstract

Outlier detection is important in many fields. The concept about outlier factor of object is extended to the case of cluster. Based on outlier factor of cluster, a clustering-based outlier detection method, named CBOD, is presented. The method consists of two stages, the first stage cluster dataset by one-pass clustering algorithm and second stage determine outlier cluster by outlier factor. The time complexity of CBOD is nearly linear with the size of dataset and the number of attributes, which results in good scalability and adapts to large dataset. The theoretic analysis and the experimental results show that the detection method is effective and practicable.
基于聚类的离群点检测方法
异常值检测在许多领域都很重要。将目标离群因子的概念推广到聚类的情况。基于聚类的离群因子,提出了一种基于聚类的离群因子检测方法——CBOD。该方法分为两个阶段,第一阶段通过一次聚类算法对数据集进行聚类,第二阶段通过离群因子确定离群点聚类。CBOD的时间复杂度与数据集的大小和属性的数量呈近似线性关系,具有良好的可扩展性,适合于大型数据集。理论分析和实验结果表明,该检测方法是有效可行的。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信