Juntao Yang , Lijun Yang , Dongming Tang , Tao Liu
{"title":"Escape velocity-based adaptive outlier detection algorithm","authors":"Juntao Yang , Lijun Yang , Dongming Tang , Tao Liu","doi":"10.1016/j.knosys.2025.113116","DOIUrl":null,"url":null,"abstract":"<div><div>Outlier detection is a pivotal technique within the realm of data mining, serving to pinpoint aberrant values nestled within datasets. It has been widely employed across diverse domains, including detection of credit card frauds, identification of seismic activities, and identification of anomalies within image datasets. However, existing approaches still face three shortcomings: (1) they often struggle with the intricacies of parameter selection and the vexing top-n dilemma, (2) they lack in their capacity to discern local outliers, and (3) their algorithmic efficacies markedly wane as datasets burgeon in sample point size and outlier prevalence. In addressing these formidable hurdles, we propose a novel, <strong>E</strong>scape <strong>V</strong>elocity-based adaptive <strong>O</strong>utlier <strong>D</strong>etection algorithm, noted as EVOD. The EVOD algorithm calculates the escape velocity of each data sample point and automatically detects the number of outliers by monitoring peak fluctuations in the growth rate of escape velocities of sample points, thereby solving the top-n problem suffered by existing outlier detection algorithms. Experimental results demonstrate that our algorithm, without requiring manual adjustment of parameters, can simultaneously detect global outliers, local outliers, and outlier clusters. In addition, it maintains a good performance even as the number of sample points and outliers in the dataset increases, particularly for complex manifold datasets.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"311 ","pages":"Article 113116"},"PeriodicalIF":7.2000,"publicationDate":"2025-02-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705125001637","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Outlier detection is a pivotal technique within the realm of data mining, serving to pinpoint aberrant values nestled within datasets. It has been widely employed across diverse domains, including detection of credit card frauds, identification of seismic activities, and identification of anomalies within image datasets. However, existing approaches still face three shortcomings: (1) they often struggle with the intricacies of parameter selection and the vexing top-n dilemma, (2) they lack in their capacity to discern local outliers, and (3) their algorithmic efficacies markedly wane as datasets burgeon in sample point size and outlier prevalence. In addressing these formidable hurdles, we propose a novel, Escape Velocity-based adaptive Outlier Detection algorithm, noted as EVOD. The EVOD algorithm calculates the escape velocity of each data sample point and automatically detects the number of outliers by monitoring peak fluctuations in the growth rate of escape velocities of sample points, thereby solving the top-n problem suffered by existing outlier detection algorithms. Experimental results demonstrate that our algorithm, without requiring manual adjustment of parameters, can simultaneously detect global outliers, local outliers, and outlier clusters. In addition, it maintains a good performance even as the number of sample points and outliers in the dataset increases, particularly for complex manifold datasets.
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.