{"title":"Ordinal isolation: An efficient and effective intelligent outlier detection algorithm","authors":"Gang Chen, Yuan-li Cai, Juan Shi","doi":"10.1109/CYBER.2011.6011757","DOIUrl":null,"url":null,"abstract":"Outlier detection plays important roles in intelligent cyber systems, especially for fault-tolerant and adaptive ones. Traditional algorithms always need to evaluate distances or densities, which are very time-consuming. On the increasingly urgent demand for real-time, during past years, various novel algorithms have been proposed. They are much faster, but less stable and accurate. To cope with these problems, with the core idea of ordinal optimization and the ‘few and different’ characteristics of outliers, by introducing the concept of outlier probability, we propose the ordinal isolation algorithm, which extracts outliers in terms of the order of being isolated in a recursive uniform data space partition process. It doesn't need any distance or density evaluating, and the complexity is reduced to O(n). Experiments show that, the CPU time of ordinal isolation increases linearly with linearly growing data sets. Furthermore, compared with recent iForest algorithm, ordinal isolation is about 30 times faster, with 20% to 30% improvement in accuracy, and especially is much more stable. Ordinal isolation also has good scalability, so it works well in high-dimensional data sets which have a huge number of instances and irrelevant attributes.","PeriodicalId":131682,"journal":{"name":"2011 IEEE International Conference on Cyber Technology in Automation, Control, and Intelligent Systems","volume":"53 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-03-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 IEEE International Conference on Cyber Technology in Automation, Control, and Intelligent Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CYBER.2011.6011757","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
Outlier detection plays important roles in intelligent cyber systems, especially for fault-tolerant and adaptive ones. Traditional algorithms always need to evaluate distances or densities, which are very time-consuming. On the increasingly urgent demand for real-time, during past years, various novel algorithms have been proposed. They are much faster, but less stable and accurate. To cope with these problems, with the core idea of ordinal optimization and the ‘few and different’ characteristics of outliers, by introducing the concept of outlier probability, we propose the ordinal isolation algorithm, which extracts outliers in terms of the order of being isolated in a recursive uniform data space partition process. It doesn't need any distance or density evaluating, and the complexity is reduced to O(n). Experiments show that, the CPU time of ordinal isolation increases linearly with linearly growing data sets. Furthermore, compared with recent iForest algorithm, ordinal isolation is about 30 times faster, with 20% to 30% improvement in accuracy, and especially is much more stable. Ordinal isolation also has good scalability, so it works well in high-dimensional data sets which have a huge number of instances and irrelevant attributes.