{"title":"基于聚类的离群点检测方法","authors":"Sheng-yi Jiang, Q. An","doi":"10.1109/FSKD.2008.244","DOIUrl":null,"url":null,"abstract":"Outlier detection is important in many fields. The concept about outlier factor of object is extended to the case of cluster. Based on outlier factor of cluster, a clustering-based outlier detection method, named CBOD, is presented. The method consists of two stages, the first stage cluster dataset by one-pass clustering algorithm and second stage determine outlier cluster by outlier factor. The time complexity of CBOD is nearly linear with the size of dataset and the number of attributes, which results in good scalability and adapts to large dataset. The theoretic analysis and the experimental results show that the detection method is effective and practicable.","PeriodicalId":208332,"journal":{"name":"2008 Fifth International Conference on Fuzzy Systems and Knowledge Discovery","volume":"305 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2008-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"97","resultStr":"{\"title\":\"Clustering-Based Outlier Detection Method\",\"authors\":\"Sheng-yi Jiang, Q. An\",\"doi\":\"10.1109/FSKD.2008.244\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Outlier detection is important in many fields. The concept about outlier factor of object is extended to the case of cluster. Based on outlier factor of cluster, a clustering-based outlier detection method, named CBOD, is presented. The method consists of two stages, the first stage cluster dataset by one-pass clustering algorithm and second stage determine outlier cluster by outlier factor. The time complexity of CBOD is nearly linear with the size of dataset and the number of attributes, which results in good scalability and adapts to large dataset. The theoretic analysis and the experimental results show that the detection method is effective and practicable.\",\"PeriodicalId\":208332,\"journal\":{\"name\":\"2008 Fifth International Conference on Fuzzy Systems and Knowledge Discovery\",\"volume\":\"305 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2008-10-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"97\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2008 Fifth International Conference on Fuzzy Systems and Knowledge Discovery\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/FSKD.2008.244\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2008 Fifth International Conference on Fuzzy Systems and Knowledge Discovery","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/FSKD.2008.244","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Outlier detection is important in many fields. The concept about outlier factor of object is extended to the case of cluster. Based on outlier factor of cluster, a clustering-based outlier detection method, named CBOD, is presented. The method consists of two stages, the first stage cluster dataset by one-pass clustering algorithm and second stage determine outlier cluster by outlier factor. The time complexity of CBOD is nearly linear with the size of dataset and the number of attributes, which results in good scalability and adapts to large dataset. The theoretic analysis and the experimental results show that the detection method is effective and practicable.