{"title":"基于分数距离度量和离群度的模糊数据集离群点检测改进定位方法","authors":"Mehdi Hajiloei, A. F. Jahromi, Somayeh Zolmani","doi":"10.3233/jifs-234448","DOIUrl":null,"url":null,"abstract":"Density based methods are significant approaches in outlier detection for high dimensional datasets and Local correlation integral (LOCI) is one of the best of them. To extend LOCI for fuzzy datasets, we should employ suitable metrics to measure the distance between two fuzzy numbers. Euclidean distance measure is a classic one in metric learning, but to overcome curse of dimensionality, we apply fractional distance metric too. Then, after introducing the FLOCI outlier detection algorithm for identifying the fuzzy outliers, we study the efficiency of the proposed method by doing some numerical experiments, in which the obtained results were completely successfull. We also compared the results with Fuzzy versions of Distance based ABOD and SOD methods to prove robustness of this approache. More than the above, one of the main advantages of the new approach is the determination of outlierness factor for each data which is not presented in classical LOCI method.","PeriodicalId":509313,"journal":{"name":"Journal of Intelligent & Fuzzy Systems","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An improved loci method for outlier detection in fuzzy datasets based on fractional distance metric and outlierness degree\",\"authors\":\"Mehdi Hajiloei, A. F. Jahromi, Somayeh Zolmani\",\"doi\":\"10.3233/jifs-234448\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Density based methods are significant approaches in outlier detection for high dimensional datasets and Local correlation integral (LOCI) is one of the best of them. To extend LOCI for fuzzy datasets, we should employ suitable metrics to measure the distance between two fuzzy numbers. Euclidean distance measure is a classic one in metric learning, but to overcome curse of dimensionality, we apply fractional distance metric too. Then, after introducing the FLOCI outlier detection algorithm for identifying the fuzzy outliers, we study the efficiency of the proposed method by doing some numerical experiments, in which the obtained results were completely successfull. We also compared the results with Fuzzy versions of Distance based ABOD and SOD methods to prove robustness of this approache. More than the above, one of the main advantages of the new approach is the determination of outlierness factor for each data which is not presented in classical LOCI method.\",\"PeriodicalId\":509313,\"journal\":{\"name\":\"Journal of Intelligent & Fuzzy Systems\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-03-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Intelligent & Fuzzy Systems\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3233/jifs-234448\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Intelligent & Fuzzy Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3233/jifs-234448","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
基于密度的方法是高维数据集离群点检测的重要方法,而局部相关积分(LOCI)是其中最好的方法之一。要将局部相关积分扩展到模糊数据集,我们应该采用合适的度量方法来测量两个模糊数之间的距离。欧氏距离度量是度量学习中的经典度量,但为了克服维度诅咒,我们也采用了分数距离度量。然后,在介绍了用于识别模糊离群值的 FLOCI 离群值检测算法后,我们通过一些数值实验研究了所提方法的效率,实验结果完全正确。我们还将结果与基于距离的模糊 ABOD 和 SOD 方法进行了比较,以证明这种方法的稳健性。除上述优点外,新方法的主要优点之一是可以确定每个数据的离群因子,而这是经典 LOCI 方法所不具备的。
An improved loci method for outlier detection in fuzzy datasets based on fractional distance metric and outlierness degree
Density based methods are significant approaches in outlier detection for high dimensional datasets and Local correlation integral (LOCI) is one of the best of them. To extend LOCI for fuzzy datasets, we should employ suitable metrics to measure the distance between two fuzzy numbers. Euclidean distance measure is a classic one in metric learning, but to overcome curse of dimensionality, we apply fractional distance metric too. Then, after introducing the FLOCI outlier detection algorithm for identifying the fuzzy outliers, we study the efficiency of the proposed method by doing some numerical experiments, in which the obtained results were completely successfull. We also compared the results with Fuzzy versions of Distance based ABOD and SOD methods to prove robustness of this approache. More than the above, one of the main advantages of the new approach is the determination of outlierness factor for each data which is not presented in classical LOCI method.