System performance anomaly detection using tracing data analysis

Proceedings of the 2019 5th International Conference on Computer and Technology Applications Pub Date : 2019-04-16 DOI:10.1145/3323933.3324085

Iman Kohyarnejadfard, Mahsa Shakeri, Daniel Aloise

{"title":"System performance anomaly detection using tracing data analysis","authors":"Iman Kohyarnejadfard, Mahsa Shakeri, Daniel Aloise","doi":"10.1145/3323933.3324085","DOIUrl":null,"url":null,"abstract":"In recent years, distributed systems have become increasingly complex as they grow in both scale and functionality. Such complexity makes these systems prone to performance anomalies. Efficient anomaly detection frameworks enable rapid recovery mechanisms to increase the system's reliability. In this paper, we present an anomaly detection approach for practical monitoring of processes running on a system to detect anomalous vectors of system calls. Our proposed methodology employs a Linux tracing toolkit (LTTng) to monitor the processes running on a system and extracts the streams of system calls. The system calls streams are split into short sequences using a sliding window strategy. Unlike previous studies, our proposed approach computes the execution time of system calls in addition to the frequency of each individual call in a window. Finally, a multi-class support vector machine approach is applied to evaluate the performance of the system and detect the anomalous sequences. A comprehensive experimental study on a real dataset collected using LTTng demonstrates that our proposed method is able to distinguish normal sequences from anomalous ones with CPU or memory related problems.","PeriodicalId":137904,"journal":{"name":"Proceedings of the 2019 5th International Conference on Computer and Technology Applications","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2019 5th International Conference on Computer and Technology Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3323933.3324085","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

In recent years, distributed systems have become increasingly complex as they grow in both scale and functionality. Such complexity makes these systems prone to performance anomalies. Efficient anomaly detection frameworks enable rapid recovery mechanisms to increase the system's reliability. In this paper, we present an anomaly detection approach for practical monitoring of processes running on a system to detect anomalous vectors of system calls. Our proposed methodology employs a Linux tracing toolkit (LTTng) to monitor the processes running on a system and extracts the streams of system calls. The system calls streams are split into short sequences using a sliding window strategy. Unlike previous studies, our proposed approach computes the execution time of system calls in addition to the frequency of each individual call in a window. Finally, a multi-class support vector machine approach is applied to evaluate the performance of the system and detect the anomalous sequences. A comprehensive experimental study on a real dataset collected using LTTng demonstrates that our proposed method is able to distinguish normal sequences from anomalous ones with CPU or memory related problems.

查看原文本刊更多论文

系统性能异常检测采用跟踪数据分析

近年来，随着分布式系统在规模和功能上的增长，它们变得越来越复杂。这种复杂性使得这些系统容易出现性能异常。高效的异常检测框架支持快速恢复机制，提高系统的可靠性。在本文中，我们提出了一种异常检测方法，用于实际监控系统上运行的进程，以检测系统调用的异常向量。我们提出的方法使用Linux跟踪工具包(ltng)来监视系统上运行的进程并提取系统调用流。系统调用流使用滑动窗口策略分成短序列。与以前的研究不同，我们提出的方法除了计算窗口中每个单独调用的频率外，还计算系统调用的执行时间。最后，采用多类支持向量机方法对系统性能进行评估，并对异常序列进行检测。在ltng采集的真实数据集上进行的综合实验研究表明，我们提出的方法能够区分正常序列和具有CPU或内存相关问题的异常序列。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2019 5th International Conference on Computer and Technology Applications

自引率

0.00%

发文量