Robert K. L. Kennedy, Zahra Salekshahrezaee, T. Khoshgoftaar
{"title":"基于迭代清洗方法的类不平衡认知数据无监督异常检测","authors":"Robert K. L. Kennedy, Zahra Salekshahrezaee, T. Khoshgoftaar","doi":"10.1109/IRI58017.2023.00060","DOIUrl":null,"url":null,"abstract":"The presence of class imbalance in machine learning datasets is a pervasive challenge that often hampers the effectiveness of traditional machine learning models. In the context of anomaly detection, the instances in the minority class are the ones of most interest. To address this issue, we evaluate an unsupervised approach that uses an iterative cleaning process for anomaly detection on cognition data. We conduct experiments on two cognition datasets, one has a large degree of class imbalance and the other is balanced. Our findings show that the unsupervised iterative cleaning approach outperforms two other unsupervised models, namely Isolation Forest and Copula-Based Outlier Detector, in the class-imbalanced dataset. The approach does not outperform both the other two models on the balanced dataset, making the approach presented particularly well-suited when there is a large class imbalance in cognition data.","PeriodicalId":290818,"journal":{"name":"2023 IEEE 24th International Conference on Information Reuse and Integration for Data Science (IRI)","volume":"33 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Unsupervised Anomaly Detection of Class Imbalanced Cognition Data Using an Iterative Cleaning Method\",\"authors\":\"Robert K. L. Kennedy, Zahra Salekshahrezaee, T. Khoshgoftaar\",\"doi\":\"10.1109/IRI58017.2023.00060\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The presence of class imbalance in machine learning datasets is a pervasive challenge that often hampers the effectiveness of traditional machine learning models. In the context of anomaly detection, the instances in the minority class are the ones of most interest. To address this issue, we evaluate an unsupervised approach that uses an iterative cleaning process for anomaly detection on cognition data. We conduct experiments on two cognition datasets, one has a large degree of class imbalance and the other is balanced. Our findings show that the unsupervised iterative cleaning approach outperforms two other unsupervised models, namely Isolation Forest and Copula-Based Outlier Detector, in the class-imbalanced dataset. The approach does not outperform both the other two models on the balanced dataset, making the approach presented particularly well-suited when there is a large class imbalance in cognition data.\",\"PeriodicalId\":290818,\"journal\":{\"name\":\"2023 IEEE 24th International Conference on Information Reuse and Integration for Data Science (IRI)\",\"volume\":\"33 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE 24th International Conference on Information Reuse and Integration for Data Science (IRI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IRI58017.2023.00060\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE 24th International Conference on Information Reuse and Integration for Data Science (IRI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IRI58017.2023.00060","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Unsupervised Anomaly Detection of Class Imbalanced Cognition Data Using an Iterative Cleaning Method
The presence of class imbalance in machine learning datasets is a pervasive challenge that often hampers the effectiveness of traditional machine learning models. In the context of anomaly detection, the instances in the minority class are the ones of most interest. To address this issue, we evaluate an unsupervised approach that uses an iterative cleaning process for anomaly detection on cognition data. We conduct experiments on two cognition datasets, one has a large degree of class imbalance and the other is balanced. Our findings show that the unsupervised iterative cleaning approach outperforms two other unsupervised models, namely Isolation Forest and Copula-Based Outlier Detector, in the class-imbalanced dataset. The approach does not outperform both the other two models on the balanced dataset, making the approach presented particularly well-suited when there is a large class imbalance in cognition data.